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<210> 1 
<211> 1570 
<212> DNA 

<213> Trichoderma reesei 
<400> 1 

atgtatcgga agttggccgt catctcggcc ttcttggcca cagctcgtgc tcagtcggcc 60 

tgcactctcc aatcggagac tcacccgcct ctgacatggc agaaatgctc gtctggtggc 120- 

acttgcactc aacagacagg ctccgtggtc atcgacgcca actggcgctg gactcacgct 180. 

acgaacagca gcacgaactg ctacgatggc aacacttgga gctcgaccct atgtcctgac 240 

aacgagacct gcgcgaagaa ctgctgtctg gacggtgccg cctacgcgtc cacgtacgga 300 

gttaccacga gcggtaacag cctctccatt ggctttgtca cccagtctgc gcagaagaac 360 

gttggcgctc gcctttacct tatggcgagc gacacgacct accaggaatt caccctgctt 420 

ggcaacgagt tctctttcga tgttgatgtt tcgcagctgc cgtaagtgac ttaccatgaa 480 

cccctgacgt atcttcttgt gggctcccag ctgactggcc aatttaaggt gcggcttgaa 540 

cggagctctc tacttcgtgt ccatggacgc ggatggtggc gtgagcaagt atcccaccaa 600 

caccgctggc gccaagtacg gcacggggta ctgtgacagc cagtgtcccc gcgatctgaa 660 

gttcatcaat ggccaggcca acgttgaggg ctgggagccg tcatccaaca acgcaaacac 720 

gggcattgga ggacacggaa gctgctgctc tgagatggat atctgggagg ccaactccat 780 

ctccgaggct cttacccccc acccttgcac gactgtcggc caggagatct gcgagggtga 840 

tgggtgcggc ggaacttact ccgataacag atatggcggc acttgcgatc ccgatggctg 900 

cgactggaac ccataccgcc tgggcaacac cagcttctac ggccctggct caagctttac 960 

cctcgatacc accaagaaat tgaccgttgt cacccagttc gagacgtcgg gtgccatcaa 1020 

ccgatactat gtccagaatg gcgtcacttt ccagcagccc aacgccgagc ttggtagtta 1080 

ctctggcaac gagctcaacg atgattactg cacagctgag gaggcagaat tcggcggatc 1140 

ctctttctca gacaagggcg gcctgactca gttcaagaag gctacctctg gcggcatggt 1200 

tctggtcatg agtctgtggg atgatgtgag tttgatggac aaacatgcgc gttgacaaag 1260 

agtcaagcag ctgactgaga tgttacagta ctacgccaac atgctgtggc tggactccac 1320 

ctacccgaca aacgagacct cctccacacc cggtgccgtg cgcggaagct gctccaccag 1380 

ctccggtgtc cctgctcagg tcgaatctca gtctcccaac gccaaggtca ccttctccaa 1440 

. catcaagttc ggacccattg gcagcaccgg caaccctagc ggcggcaacc ctcccggcgg 1500 

aaacccgcct ggcaccacca ccacccgccg cccagccact accactggaa gctctcccgg 1560 

acctactagt 1570 



<210> 2 



<211> 51 
<212> DNA 

<213> Trichoderma reesei 
<400> 2 

atgtatcgga agttggccgt catctcggcc ttcttggcca cagctcgtgc t 51 

<210> 3 
<211> 1438 
<212> DNA 

<213> Trichoderma reesei 
<400> 3 

cagtcggcct gcactctcca atcggagact cacccgcctc tgacatggca gaaatgctcg 60 

tctggtggca cttgcactca acagacaggc tccgtggtca tcgacgccaa ctggcgctgg 120 

actcacgcta cgaacagcag cacgaactgc tacgatggca acacttggag ctcgacccta 180 

tgtcctgaca acgagacctg cgcgaagaac tgctgtctgg acggtgccgc ctacgcgtcc 24 0 

acgtacggag ttaccacgag cggtaacagc ctctccattg gctttgtcac ccagtctgcg 3 00 

cagaagaacg ttggcgctcg cctttacctt atggcgagcg acacgaccta ccaggaattc 3 60 

accctgcttg gcaacgagtt ctctttcgat gttgatgttt cgcagctgcc gtaagtgact 420 

taccatgaac ccctgacgta tcttcttgtg ggctcccagc tgactggcca atttaaggtg 4 80 

cggcttgaac ggagctctct acttcgtgtc catggacgcg gatggtggcg tgagcaagta 540 

tcccaccaac accgctggcg ccaagtacgg cacggggtac tgtgacagcc agtgtccccg 600 

cgatctgaag ttcatcaatg gccaggccaa cgttgagggc tgggagccgt catccaacaa 660 

cgcaaacacg ggcattggag gacacggaag ctgctgctct gagatggata tctgggaggc 720 

caactccatc tccgaggctc ttacccccca cccttgcacg actgtcggcc aggagatctg 780 

cgagggtgat gggtgcggcg gaacttactc cgataacaga tatggcggca cttgcgatcc 84 0 

cgatggctgc gactggaacc cataccgcct gggcaacacc agcttctacg gccctggctc 900 

aagctttacc ctcgatacca ccaagaaatt gaccgttgtc acccagttcg agacgtcggg 960 

tgccatcaac cgatactatg tccagaatgg cgtcactttc cagcagccca acgccgagct 1020 

tggtagttac tctggcaacg agctcaacga tgattactgc acagctgagg aggcagaatt 1080 

cggcggatcc tctttctcag acaagggcgg cctgactcag ttcaagaagg ctacctctgg 1140 

cggcatggtt ctggtcatga gtctgtggga tgatgtgagt ttgatggaca aacatgcgcg 1200 

ttgacaaaga gtcaagcagc tgactgagat gttacagtac tacgccaaca tgctgtggct 1260 
ggactccacc tacccgacaa acgagacctc ctccacaccc ggtgccgtgc gcggaagctg * 1320 

ctccaccagc tccggtgtcc ctgctcaggt cgaatctcag tctcccaacg ccaaggtcac 1380 

cttctccaac atcaagttcg gacccattgg cagcaccggc aaccctagcg gcggcaac 1438 

<210> 4 
<211> 81 
<212> DNA 

<213> Trichoderma reesei 
<400> 4 

cctcccggcg gaaacccgcc tggcaccacc accacccgcc gcccagccac taccactgga 60 
agctctcccg gacctactag t 81 

<210> 5 
<211> 480 
<212> PRT 

<213> Trichoderma reesei 
<400> 5 

Met Tyr Arg Lys Leu Ala Val lie Ser Ala Phe Leu Ala Thr Ala Arg 

15 10 15 

Ala Gin Ser Ala Cys Thr Leu Gin Ser Glu Thr His Pro Pro Leu Thr 



Trp 


Gin 


Lvs 


Cvs 


Ser 


Ser 


Gly 


•Gly 






35 










40 


Val 


Val 


He 


Asp 


Ala 


Asn 


Trp 


Arg 




50 










55 




Thr 


Asn 


Cvs 


Tyr 


Asp 


Gly 


Asn 


Thr 


65 










70 






Asn 


Glu 


Thr 


Cys 


Ala 


Lys 


Asn 


Cys 










85 








Ser 


Thr 


Tyr 


Gly Val 


Thr 


Thr 


Ser 








100 










Val 


Thr 


Gin 


Ser 


Ala 


Gin 


Lys 


Asn 






115 










120 


Ala 


Ser 


Asp 


Thr 


Thr 


Tyr 


Gin 


Glu 




130 










135 




Ser 


Phe 


Asp 


Val 


Asp 


Val 


Ser 


Gin 


145 










150 






Leu 


Tvr 


Phe 


Val 


Ser 


Met 


Asp 


Ala 










165 








Thr 


Asn 


Thr 


Ala 


Gly 


Ala 


Lys 


Tvr 








180 










Cys 


Pro 


Arg 


Asp 


Leu 


Lys 


Phe 


He 






195 










200 


Tro 

Br 


Glu 


Pro 


Ser 


Ser 


Asn 


Asn 


Ala 




210 










215 




Ser 


Cys 


Cys 


Ser 


Glu 


Met 


Asp 


He 


225 










230 






Ala 


Leu 


Thr 


Pro 


His 


Pro 


Cys 


Thr 










245 








Glv 


Asp 


Gly 


Cys 


Gly 


Glv 


Thr 


Tvr 








260 










Cvs 


Asp 


Pro 


Asp Gly 


Cvs 


Asp 


Tro 




Phe 


275 










280 


Ser 


Tyr Gly 


Pro 


Glv 


Ser 


Ser 




290 










295 




Leu 


Thr 


Val 


Val 


Thr 


Gin 


Phe 


Glu 


305 










310 






Tvr 


Val 


Gin. 


Asn 


Gly 


Val 


Thr 


Phe 










325 








Ser 


Tvr 


Ser 


Gly Asn 


Glu 


Leu 


Asn 








340 










Ala 


Glu 


Phe 


Gly Gly 


Ser 


Ser 


Phe 






355 










360 


Phe 


Lys 


Lys 


Ala 


Thr 


Ser 


Glv 


Glv 




370 










375 




Asp 


Asp 


Tyr 


Tyr 


Ala 


Asn 


Met 


Leu 


385 










390 






Asn 


pi,, 
o±U 


Thr 


Ser 


Ser 


1 11X 


rL U 












405 








Ser 


Ser 


Gly Val 


Pro 


Ala 


Gin 


Val 








420 










Val 


Thr 


Phe 


Ser 


Asn 


He 


Lys 


Phe 






435 










440 


Pro 


Ser 


Gly 


Gly 


Asn 


Pro 


Pro 


Gly 




450 










455 




Thr 


Arg 


Arg 


Pro 


Ala 


Thr 


Thr 


Thr 


465 










470 







25 










30 






Thr 


Cys 


Thr 


Gin 


Gin 


Thr Gly 


Ser 










45 








Trp 


Thr 


His 


Ala 


Thr 


Asn 


Ser 


Ser 








60 










Trp 


Ser 


Ser 


Thr 


Leu 


Cys 


Pro 


Asp 






75 










80 


Cys 


Leu 


Asp 


Gly 


Ala 


Ala 


Tyr 


Ala 




90 










95 




Gly Asn 


Ser 


Leu 


Ser 


He 


Gly 


Phe 


105 










110 






Val 


Gly 


Ala 


Arg 


Leu 


Tyr 


Leu 


Met 










125 








Phe 


Thr 


Leu 


Leu 


Gly Asn 


Glu 


Phe 








140 










Leu 


Pro 


Cys 


Gly 


Leu 


Asn 


Gly 


Ala 






155 










160 


Asp 


Gly 


Gly Val 


Ser 


Lys 


Tyr 


Pro 




170 










175 




Gly Thr 


Gly Tyr Cys 


Asp 


Ser 


Gin 


185 










190 






Asn Gly 


Gin 


Ala Asn 


Val 


Glu 


Gly 










205 








Asn 


Thr 


Gly 


He 


Gly Gly His 


Gly 








220 










Trp 


bill 


Ala 


Asn 


Ser 


He 


Ser 


Glu 






235 










240 


Thr 


Val 


Gly Gin Glu 


He 


Cys 


Glu 




250 










255 




Ser 


Asp 


Asn 


Arg 


Tyr 


Gly 


Gly 


Thr 


265 










270 






Asn 


Pro 


Tyr 


Arg 


Leu 


Gly Asn 


Thr 










285 








Phe 


Thr 


Leu 


Asp 


Thr 


Thr 


Lys 


Lys 








300 










Thr 


Ser 


Gly Ala 


He 


Asn 


Arg 


Tyr 






315 










320 


Gin 


Gin 


Pro 


Asn 


Ala 


Glu 


Leu 


Gly 




330 










335 




Asp 


Asp 


Tyr 


Cys 


Thr 


Ala 


Glu 


Glu 


345 










350 






Ser 


Asp 


Lys 


Gly Gly Leu 


Thr 


Gin 










365 








Met 


Val 


Leu 


Val 


Met 


Ser 


Leu 


Tro 








380 










Trp 


Leu 


Asp 


Ser 


Thr 


Tyr 


Pro 


Thr 






395 










400 


Ala 


Val 


Arg 


Gly 


Ser 


Cys 


Ser 


Thr 




410 










415 




Glu 


Ser 


Gin 


Ser 


Pro 


Asn 


Ala 


T \ra 

Jjyb 


425 










430 






Gly 


Pro 


He 


Gly 


Ser 


Thr 


Gly 


Asn 










445 








Gly 


Asn 


Pro 


Pro 


Gly 


Thr 


Thr 


Thr 








460 










Gly 


Ser 


Ser 


Pro 


Gly 


Pro 


Thr 


Ser 






475 










480 



<210> 6 

<211> 431 

<212> PRT 

<213> Trichoderma reesei 



<400> 6 



Gin 


Ser 


Ala 


Cys 


Thr 


Leu 


Gin 


Ser 


Glu 


Thr 


His 


Pro 


Pro 


Leu 


Thr 


Trp 


1 








5 










10 










15 




Gin 


Lys 


Cys 


Ser 


Ser 


Gly 


Gly 


Thr 


Cys 


Thr 


Gin 


Gin 


Thr 


Gly 


Ser 


Val 








20 










25 










30 






Val 


lie 


Asp 


Ala 


Asn 


Trp 


Arg 


Trp 


Thr 


His 


Ala 


Thr 


Asn 


Ser 


Ser 


Thr 






35 










40 










45 








Asn 


Cys 


Tvr 


Asp 


Gly 


Asn 


Thr 


Trp 


Ser 


Ser 


Thr 


Leu 


Cys 


Pro 


Asp 


Asn 




50 










55 










60 










Glu 


Thr 


Cvs 


Ala 


Lvs 


Asn 


Cys 


Cys 


Leu 


Asp 


Gly 


Ala 


Ala 


Tyr 


Ala 


Ser 


65 










70 










75 










80 


Thr 


Tvr 


Glv 


Val 


Thr 


Thr 


Ser Gly 


Asp 


Ser 


Leu 


Ser 


He 


Gly 


Phe 


Val 










85 










90 










95 




Thr 


Gin 


Ser 


Ala 


Gin 


Lvs 


Asn 


Val 


Gly Ala Arg Leu 


Tyr 


Leu 


Met 


Ala 








100 










105 










110 






Ser 


Asp 


Thr 


Thr 


Tvr 


Gin 


Glu 


Phe 


Thr 


Leu Leu Gly Asn 


Glu 


Phe 


Ser 






115 










120 










125 








Phe 


Asp 


Val 


Asp 


Val 


Ser 


Gin 


Leu 


Pro 


Cys 


Gly 


Leu 


Asn 


Gly 


Ala 


Leu 




130 










135 










140 


i 








Tvr 


Phe 


Val 


Ser 


Met 


Asp 


Ala 


Asp 


Gly Gly Val 


Ser 


Lys 


Tyr 


Pro 


Thr 


145 










150 










155 










160 


Asn 


Thr 


Ala 


Glv 


Ala 


Lys 


Tyr 


Gly 


Thr 


Gly Tyr Cys 


Asp 


Ser 


Gin 


Cys 










165 










170 










175 






Arg 


Asp 


Leu 


Lys 


Phe 


He 


Asn 


Gly Gin Ala 


Asn 


Val 


Glu 


Glv 


Trn 








180 










185 










190 






Glu 


Pro 


Ser 


Ser 


Asn 


Asn 


Ala 


Asn 


Thr 


Gly 


He 


Gly Gly 


His 


Gly 


Ser 






195 










200 










205 








Cys 


Cvs 


Ser 


Glu 


Met 


Asp 


He 


Trp 


Glu 


Ala 


Asn 


Ser 


He 


Ser 


Glu 


Ala 




210 










215 










220 










Leu 


Thr 


Pro 


His 


Pro 


Cvs 


Thr 


Thr 


Val 


Gly Gin Glu 


He 


Cys 


Glu 


Gly 


225 










230 










235 










240 


Asp 


Glv 


Cvs 


Glv 


Glv 


Thr 


Tyr 


Ser 


Asp 


Asn 


Arg 


Tyr 


Gly 


Gly 


Thr 


Cvs 










245 










250 










255 




Asp 


Pro 


Asp 


Glv 


Cvs 


Asp 


Trp 


Asn 


Pro 


Tyr 


Arg 


Leu 


Gly 


Asn 


Thr 


Ser 








260 










265 










270 






n ic 


T\/v 
i Y I 




Jrx \J 






Ser 


Phe 


Thr 


Leu 


Asp 


Thr 


Thr 


Lys 


Lys 


Leu 






275 










280 










285 








1 Ilx 


\7-3 1 
Val 


veil 


1 111. 




me 


Glu 


Thr 


Ser Gly Ala 


He 


Asn 


Arg 




i y x 




290 










295 










300 










Val 


Gin 


Asn 


Gly 


Val 


Thr 


Phe 


Gin 


Gin 


Pro 


Asn 


Ala 


Glu 


Leu 


Gly 


Ser 


305 










310 










315 










320 


Tyr 


Ser 


Gly 


Asn 


Glu 


Leu 


Asn 


Asp 


Asp 


Tyr 


Cys 


Thr 


Ala 


Glu 


Glu 


Ala 










325 










330 










335 




Glu 


Phe 


Gly 


Gly 


Ser 


Ser 


Phe 


Ser 


Asp 


Lys 


Gly Gly Leu 


Thr 


Gin 


Phe 








340 










345 










350 






Lys 


Lys 


Ala 


Thr 


Ser 


Gly 


Gly Met 


Val 


Leu 


Val 


Met 


Ser 


Leu 


Trp 


Asp 






355 










360 










365 








Asp 


Tyr 


Tyr 


Ala 


Asn 


Met 


Leu 


Trp 


Leu 


Asp 


Ser 


Thr 


Tyr 


Pro 


Thr 


Asn 




370 










375 










380 










Glu 


Thr 


Ser 


Ser 


Thr 


Pro 


Gly Ala 


Val 


Arg 


Gly 


Ser 


Cys 


Ser 


Thr 


Ser 


385 










390 










395 










400 



Ser Gly Val Pro 

Thr Phe Ser Asn 
420 



Ala Gin Val Glu 
405 

lie Lys Phe Gly 



Ser Gin Ser Pro 
410 

Pro lie Gly Ser 
425 



Asn Ala Lys Val 
415 

Thr Gly Asn 
430 



<210> 7 
<211> 1077 
<212> DNA 

<213> Acidothermus cellulolyticus 
<400> 7 

gcgggcggcg gctattggca cacgagcggc cgggagatcc tggacgcgaa caacgtgccg 1 60 

gtacggatcg ccggcatcaa ctggtttggg ttcgaaacct gcaattacgt cgtgcacggt 120 

ctctggtcac gcgactaccg cagcatgctc gaccagataa agtcgctcgg ctacaacaca 180 

atccggctgc cgtactctga cgacattctc aagccgggca ccatgccgaa cagcatcaat 240 

ttttaccaga tgaatcagga cctgcagggt ctgacgtcct tgcaggtcat ggacaaaatc 300 

gtcgcgtacg ccggtcagat cggcctgcgc atcattcttg accgccaccg accggattgc 360 

agcgggcagt cggcgctgtg gtacacgagc agcgtctcgg aggctacgtg gatttccgac 420 

ctgcaagcgc tggcgcagcg ctacaaggga aacccgacgg tcgtcggctt tgacttgcac 480 

aacgagccgc atgacccggc ctgctggggc tgcggcgatc cgagcatcga ctggcgattg 540 

gccgccgagc gggccggaaa cgccgtgctc tcggtgaatc cgaacctgct cattttcgtc 600 

gaaggtgtgc agagctacaa cggagactcc tactggtggg gcggcaacct gcaaggagcc 660 

ggccagtacc cggtcgtgct gaacgtgccg aaccgcctgg tgtactcggc gcacgactac 720 

gcgacgagcg tctacccgca gacgtggttc agcgatccga ccttccccaa caacatgccc 780 

ggcatctgga acaagaactg gggatacctc ttcaatcaga acattgcacc ggtatggctg 840 

ggcgaattcg gtacgacact gcaatccacg accgaccaga cgtggctgaa gacgctcgtc 900 

cagtacctac ggccgaccgc gcaatacggt gcggacagct tccagtggac cttctggtcc 960 

tggaaccccg attccggcga cacaggagga attctcaagg atgactggca gacggtcgac 1020 

acagtaaaag acggctatct cgcgccgatc aagtcgtcga ttttcgatcc tgtcggc 1077 



<210> 8 

<211> 359 

<212> PRT 

<213> Acidothermus cellulolyticus 



<400> 8 

Ala Gly Gly Gly 

1 

Asn Asn Val Pro 
20 

Thr Cys Asn Tyr 
35 

Met Leu Asp Gin 
50 

Tyr Ser Asp Asp 
65 

Phe Tyr Gin Met 

Met Asp Lys lie 
100 

Leu Asp Arg His 
115 

Thr Ser Ser Val 
130 

Ala Gin Arg Tyr 
145 

Asn Glu Pro His 



Tyr Trp His Thr 
5 

Val Arg He Ala 

Val Val His Gly 
40 

He Lys Ser Leu 
55 

He Leu Lys Pro 
70 

Asn Gin Asp Leu 
85 

Val Ala Tyr Ala 

Arg Pro Asp Cys 
120 

Ser Glu Ala Thr 
135 

Lys Gly Asn Pro 
150 

Asp Pro Ala Cys 



Ser Gly Arg Glu 
10 

Gly He Asn Trp 
25 

Leu Trp Ser Arg 

Gly Tyr Asn Thr 
60 

Gly Thr Met Pro 
75 

Gin Gly Leu Thr 
90 

Gly Gin He Gly 

105 . 

Ser Gly Gin Ser 

Trp lie Ser Asp 
140 

Thr Val Val Gly 
155 

Trp Gly Cys Gly 



He Leu Asp Ala 
15 

Phe Gly Phe Glu 
30 

Asp Tyr Arg Ser 
45 

He Arg Leu Pro 

Asn Ser lie Asn 
80 

Ser Leu Gin Val 
95 

Leu Arg He He 
110 

Ala Leu Trp Tyr 
125 

Leu Gin Ala Leu 

Phe Asp Leu His 
160 

Asp Pro Ser He 











165 










170 










175 




Asp 


Trp 


Arg 


Leu 


Ala 


Ala 


Glu 


Arg 


Ala Gly Asn 


Ala 


Val 


Leu 


Ser 


Val 








180 










185 










190 






Asn 


Pro 


Asn 


Leu 


Leu 


He 


Phe 


Val 


Glu 


Gly Val 


Gin 


Ser 


Tyr 


Asn 


Gly 






195 










200 










205 








Asp 


Ser 


Tyr 


Trp 


Trp 


Gly 


Gly 


Asn 


Leu 


Gin 


Gly 


Ala 


Gly 


Gin 


Tyr 


Pro 




210 










215 










220 










Val 


Val 


Leu 


Asn 


Val 


Pro 


Asn 


Arg 


Leu 


Val 


Tyr 


Ser 


Ala 


His 


Asp 


Tyr 


225 










230 










235 










240 


Ala 


Thr 


Ser 


Val 


Tyr 


Pro 


Gin 


Thr 


Trp 


Phe 


Ser 


Asp 


Pro 


Thr 


Phe 


Pro 










245 










250 










255 




Asn 


Asn 


Met 


Pro 


Gly 


He 


Trp 


Asn 


Lys 


Asn 


Trp 


Gly Tyr 


Leu 


Phe 


Asn 








260 










265 










270 






Gin 


Asn 


He 


Ala 


Pro 


Val 


Trp 


Leu 


Gly Glu 


Phe 


Gly Thr 


Thr 


Leu 


Gin 






275 










280 










285 








Ser 


Thr 


Thr 


Asp 


Gin 


Thr 


Trp 


Leu 


Lys 


Thr 


Leu 


Val 


Gin 


Tyr 


Leu 


Arg 




290 










295 










300 










Pro 


Thr 


Ala 


Gin 


Tyr 


Gly 


Ala 


Asp 


Ser 


Phe 


Gin 


Trp 


Thr 


Phe 


Trp 


Ser 


305 










310 










315 










320 


Trp 


Asn 


Pro 


Asp 


Ser 


Gly 


Asp 


Thr 


Gly Gly 


He 


Leu 


Lys 


Asp 


Asp 


Trp 










325 










330 










335 




Gin 


Thr 


Val 


Asp 


Thr 


Val 


Lys 


Asp 


Gly Tyr Leu 


Ala 


Pro 


He 


Lys 


Ser 








340 










345 










350 






Ser 


He 


Phe 


Asp 


Pro 


Val 


Gly 





















355 



<210> 9 
<211> 1914 
<212> DNA 

<213> Acidothermus cellulolyticus 
<400> 9 

aacgacccgt acatccagcg gttcctcacg atgtacaaca agattcacga cccagcgaac 60 

ggctacttca gcccgcaggg aattccctac cactcggtag aaacgctcat cgttgaggca 120 

ccggactacg ggcacgagac aacttcggag gcgtacagct tctggctctg gctcgaagcg 180 
acgtacggcg cagtgaccgg caactggacg ccgttcaaca acgcctggac gacgatggaa ' 240 

acgtacatga tcccgcagca cgcggaccag ccgaacaacg cgtcgtacaa ccccaacagc 3 00 

ccggcgtcgt acgctccgga agagccgctg cccagcatgt acccggttgc catcgacagc 360 

agcgtgccgg ttgggcacga cccgctcgcc gccgaattgc agtcgacgta cggcactccg 420 

gacatttacg gcatgcactg gctggccgac gttgacaaca tctacggata cggcgacagc 480 

cccggcggtg gttgcgaact cggtccttcc gctaagggcg tctcctacat caacacattc 540 

cagcgcggct cgcaggagtc cgtctgggag acggtcaccc agccgacgtg cgacaacggc 600 

aagtacggtg gggcgcacgg ctacgtcgac ctgttcatcc agggttcgac gccgccgcag 660 

tggaagtaca ccgatgcccc ggacgccgac gcccgtgccg tccaggctgc gtactgggcc 720 

tacacctggg catcggcgca gggcaaggca agcgcgattg ccccgacgat cgccaaggcg 780 

gccaaactcg gcgactacct gcggtactcg ctctttgaca agtacttcaa gcaggtcggc 840 

aactgctacc cggccagctc ctgccctgga gcaaccggac gccagagcga gacctacctg 900 

atcggctggt actacgcctg gggcggctca agccaaggct gggcctggcg cattggtgac 960 

ggcgccgcgc acttcggcta ccagaatccg cttgccgcgt gggcgatgtc gaacgtgaca 1020 

ccgctcattc cgctctcgcc cacggcaaag agcgactggg cggcgagctt gcagcgccag 1080 

ctggagttct accagtggtt gcaatccgcg gaaggagcca ttgcgggcgg cgccaccaac 1140 

agctggaacg gcaattacgg gaccccgccg gccggagact cgaccttcta cggcatggcg 1200 

tacgactggg agccggtcta ccacgacccg ccgagcaaca actggttcgg cttccaggcg 1260 

tggtccatgg aacgggttgc cgagtactac tacgtcaccg gcgacccgaa ggccaaggcg 1320 

ctgctcgaca agtgggtcgc atgggtgaag ccgaatgtca ccaccggtgc ctcatggtcg 1380 

attccgtcga atttgtcctg gagcggccaa ccggatacct ggaatccgag caacccagga 1440 

acgaatgcca acctgcacgt gaccatcacg tcgtccgggc aggacgtcgg tgttgccgcg 1500 



gcgctcgcga agacactcga gtactacgcg gcaaaatccg gcgatacggc ctcgcgcgac 1560 

ctcgcgaagg gattgctcga ctccatgtgg aacaacgacc aggacagcct cggtgtgagc 1620 

acaccggaga cgcggaccga ctactctcgg ttcactcagg tgtacgaccc gacgactggt 1680 

gacggcctct acatcccgt'c gggttggacg gggaccatgc ccaacggtga ccaaatcaag 1740 

ccgggtgcga ccttcctgag catccggtcc tggtacacca aggatccgca gtggtcgaag 1800 

gtgcaggcgt acctcaacgg cgggcctgct ccgacgttca actaccaccg gttctgggcg 1860 

gagtccgact tcgcgatggc gaacgccgat tttggcatgc tcttcccatc cggg 1914 

<210> 10 
<211> 638 
<212> PRT 

<213> Acidothermus cellulolyticus 
<400> 10 

Asn Asp Pro Tyr lie Gin Arg Phe Leu Thr Met Tyr Asn Lys lie His 

15 10 15 

Asp Pro Ala Asn Gly Tyr Phe ,Ser Pro Gin Gly lie Pro Tyr His Ser 

20 25 30 

Val Glu Thr Leu He Val Glu Ala Pro Asp Tyr Gly His Glu Thr Thr 

35 40 45 

Ser Glu Ala Tyr Ser Phe Trp Leu Trp Leu Glu Ala Thr Tyr Gly Ala 

50 55 60 , 

Val Thr Gly Asn Trp Thr Pro Phe Asn Asn Ala Trp Thr thr Met Glu 
65 70 75 80 

Thr Tyr Met He Pro Gin His Ala Asp Gin Pro Asn Asn Ala Ser Tyr 

85 90 95 

Asn Pro Asn Ser Pro Ala Ser Tyr Ala Pro Glu Glu Pro Leu Pro Ser 

100 105 110 

Met Tyr Pro Val Ala He Asp Ser Ser Val Pro Val Gly His Asp Pro 

115 120 125 

Leu Ala Ala Glu Leu Gin Ser Thr Tyr Gly Thr Pro Asp He Tyr Gly 

130 135 140 

Met His Trp Leu Ala Asp Val Asp Asn He Tyr Gly Tyr Gly Asp Ser 
145 150 155 160 

Pro Gly Gly Gly Cys Glu Leu Gly Pro Ser Ala Lys Gly Val Ser Tyr 

165 170 175 

lie Asn Thr Phe Gin Arg Gly Ser Gin Glu Ser Val Trp Glu Thr Val 

180 185 190 

Thr Gin Pro Thr Cys Asp Asn Gly Lys Tyr Gly Gly Ala His Gly Tyr 

195 200 205 

Val Asp Leu Phe He Gin Gly Ser Thr Pro Pro Gin Trp Lys Tyr Thr 

210 215 220 

Asp Ala Pro Asp Ala Asp Ala Arg Ala Val Gin Ala Ala Tyr Trp Ala 
225 230 235 240 

Tyr Thr Trp Ala Ser Ala Gin Gly Lys Ala Ser Ala lie Ala Pro Thr 

245 250 255 

He Ala Lys Ala Ala Lys Leu Gly Asp Tyr Leu Arg Tyr Ser Leu Phe 

260 265 270 

Asp Lys Tyr Phe Lys Gin Val Gly Asn Cys Tyr Pro Ala Ser Ser Cys 

275 280 285 

Pro Gly Ala Thr Gly Arg Gin Ser Glu Thr Tyr Leu He Gly Trp Tyr 

290 295 300 

Tyr Ala Trp Gly Gly Ser Ser Gin Gly Trp Ala Trp Arg He Gly Asp 
305 310 315 320 

Gly Ala Ala His Phe Gly Tyr Gin Asn Pro Leu Ala Ala Trp Ala Met 

325 330 335 

Ser Asn Val Thr Pro Leu He Pro Leu Ser Pro Thr Ala Lys Ser Asp 



340 345 350 



Trn 
1 ¥ 


Ala Ala 


Ser 


Leu 


Gin 


Arg 


Gin 


Leu 


Glu 


Phe 


Tvr 


Gin 


Trp 


Leu 


Gin 




355 










360 










365 








Ser 


Ala Glu 


Glv 


Ala 


He 


Ala 


Glv 
y 


Gly Ala 


Thr 


Asn 


Ser 


Trp 


Asn 


Gly 




370 








375 










380 










Asn 


Tvr* Cil v 


Thr 


Pro 


Pro 


Ala 


Glv 
y 


Asp 


Ser 


Thr 


Phe 


Tvr 
y 


Glv 


Met 


Ala 


385 








390 










395 










400 


Tvr 
x y x 


Asp Trp 


Glu 


Pro 


Val 


Tvr 
y 


His 


Asp 


Pro 


Pro 


Ser 


Asn 


Asn 


Trp 


Phe 








405. 










410 










415 




Gly 


Phe Gin 


Ala 


Tro 

J. x p> 


Ser 


Met 


Glu 


Arc* 


Val 


Ala 


Glu 


Tvr 
y 


Tvr 


Tvr 


Val 






420 










425 










430 






Thr 




Pro 


Lys 


Ala 


Lys 


Ala 


Leu 


Leu 


Asp 


Lys 


Tro 


Val 


Ala 


Trp 




435 










440 










445 








Val 


Lys Pro 


Asn 


Val 


Thr 


Thr 


Glv 


Ala 


Ser 


Tro 


Ser 


He 


Pro 


Ser 


Asn 




450 








455 










460 










Jjcu 




OCX 


Gly 


Gin 


Pro 


Asp 


Thr 


Trp 


Asn 


Pro 


Ser 


Asn 


Pro 


Glv 


46S 
\j j 








470 










475 










480 


TVi r* 


noil nlG 


Asn 


Leu 


His 


Val 


Thr 


He 


Thr 


Ser 


Ser 


Glv 


Gin 


Asp 


Val 








485 










490 










495 




vj-L y 


Val Ala 


Ala 


Ala 


Leu 


Ala 


Lys 


Thr 


Leu 


Glu 


Tvr 

x y x 


Tvr 
y 


Ala 


Ala 


Lys 






500 










505 










510 






OC1 




Thr 


Ala 


Ser 


Arg 


Asp 


Leu 


Ala 


Lys 


Gly 


Leu 


Leu 


Asp 


Ser 




515 










520 










525 








nc c 


± X £J 0 1 1 


Asn 


Asp 


Gin 


Asp 


Ser 


Leu 


Gly Val 


Ser 


Thr 


Pro 


Glu 


Thr 




530 








535 










540 


i 








rtx y 




Tvr" 


Ser 


Arg 


Phe 


Thr 


Gin 


Val 


Tyr 


Asp 


Pro 


Thr 


Thr 


Glv 

vjx_y 


J M J 








550 










555 










560 


nop 


d~\ v T ,£»n 
w x _y jjcu 


Tv*r 


He 


Pro 


Ser 


Gly 


Trp Thr Gly Thr 


Met 


Pro 


Asn 


Glv 
y 








565 










570 










575 




Asp 


Gin He 


Lys 


Pro 


Gly 


Ala 


Thr 


Phe 


Leu 


Ser 


He 


Arg 


Ser 


Trp 


Tyr 






580 










585 










590 






Thr 


Lys Asp 


Pro 


Gin 


Trp 


Ser 


Lys 


Val 


Gin 


Ala 


Tyr 


Leu 


Asn 


Gly 


Gly 




595 










600 










605 








Pro 


Ala Pro 


Thr 


Phe 


Asn 


Tyr 


His 


Arg 


Phe 


Trp 


Ala 


Glu 


Ser 


Asp 


Phe 




610 








615 










620 










Ala 


Met Ala 


Asn 


Ala 


Asp 


Phe 


Gly 


Met 


Leu 


Phe 


Pro 


Ser 


Gly 







625 630 635 



<210> 11 

<211> 2223 

<212> DNA 

<213> Acidothermus cellulolyticus 



<400> 11 

gcgacgactc 

gggatcgtct 

atgtatcgat 

aacaattggg 

gtatgggccg 

cgctcgtctg 

aacatgcccg 

ctgtatttcg 

tggtcccaga 

accggctatc 

tcgctcgggc 

ttctggagca 

atcccgcaca 



agccgtacac 
tcaatgaagg 
gggatgccgc 
ggtacaacgg 
ccgtcggaat 
atcagggcgc 
ggcgtggaat 
gcgccccgag 
tgacgaactt 
agagcgatat 
aagcgagtaa 
gagacggcgg 
agggcgtctt 



ctggagcaac 
tgcaccggga 
caacgggcgg 
cgtcgtcagc 
gtacaccaac 
aacgtggcaa 
gggcgagcgg 
cggcaaaggg 
tccggacgta 
tcaaggcgtc 
gaccattttt 
cgcgacgtgg 
tgacccggtc 



gtggcgatcg 
attctgtacg 
tggatccctc 
attgcggcag 
agctgggacc 
ataacgcccc 
cttgcggtgg 
ctctggagaa 
ggcacgtaca 
gtctgggtcg 
gtgggcgtgg 
caggcggtgc 
aaccacgtgc 



ggggcggcgg 

tgcggacgga 
ttctggattg 
acccgatcaa 
caaacgacgg 
tgccgttcaa 
atccaaacaa- 
gcacagattc 
ttgcaaatcc 
ctttcgacaa 
cggatcccaa 
cgggtgcgcc 
tctatattgc 



ctttgtcgac 
catcgggggg 
ggtgggatgg 
tactaacaag 
agcgattctc 
gcttggcggc 
tgacaacatt 
cggcgcgacc 
cactgacacg 
gtcttcgtca 
taatccggtc 
gaccggcttc 
caccagcaat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



acgggtggtc cgtatgacgg gagctccggc gacgtctgga aattctcggt gacctccggg 84 0 

acatggacgc gaatcagccc ggtaccttcg acggacacgg ccaacgacta ctttggttac 900 

agcggcctca ctatcgaccg ccagcacccg aacacgataa tggtggcaac ccagatatcg 960 

tggtggccgg acaccataat ctttcggagc accgacggcg gtgcgacgtg gacgcggatc 1020 

tgggattgga cgagttatcc caatcgaagc ttgcgatatg tgcttgacat ttcggcggag 1080 

ccttggctga ccttcggcgt acagccgaat cctcccgtac cgagtccgaa gctcggctgg 1140 

atggatgaag cgatggcaat cgatccgttc aactctgatc ggatgctcta cggaacaggc 1200 

gcgacgttgt acgcaacaaa tgatctcacg aagtgggact ccggcggcca gattcatatc 1260 

gcgccgatgg tcaaaggatt ggaggagacg gcggtaaacg atctcatcag cccgccgtct 1320 

ggcgccccgc tcatcagcgc tctcggagac ctcggcggct tcacccacgc cgacgttact 13 80 

gccgtgccat cgacgatctt cacgtcaccg gtgttcacga ccggcaccag cgtcgactat 1440 

gcggaattga atccgtcgat catcgttcgc gctggaagtt tcgatccatc gagccaaccg 1500 

aacgacaggc acgtcgcgtt ctcgacagac ggcggcaaga actggttcca aggcagcgaa 1560 

cctggcgggg tgacgacggg cggcaccgtc gccgcatcgg ccgacggctc tcgtttcgtc 162 0 

tgggctcccg gcgatcccgg tcagcctgtg gtgtacgcag tcggatttgg caactcctgg 1680 

gctgcttcgc aaggtgttcc cgccaatgcc cagatccgct cagaccgggt gaatccaaag 1740 

actttctatg ccctatccaa tggaaccttc tatcgaagca cggacggcgg cgtgacattc 1800 

caaccggtcg cggccggtct tccgagcagc ggtgccgtcg gtgtcatgtt ccacgcggtg 1860 

cctggaaaag aaggcgatct gtggctcgct gcatcgagcg ggctttacca ctcaaccaat 1920 

ggcggcagca gttggtctgc aatcaccggc gtatcctccg cggtgaacgt gggatttggt 1980 

aagtctgcgc ccgggtcgtc atacccagcc gtctttgtcg tcggcacgat cggaggcgtt 2040 

acgggggcgt accgctccga cgacggtggg acgacctggg tacggatcaa tgatgaccag 2100 

caccaatacg gaaattgggg acaagcaatc accggtgacc cgcgaattta cgggcgggtg 2160 

tacataggca cgaacggccg tggaattgtc tacggggaca ttggtggtgc gccgtccgga 2220 

•teg . i 2223 

<210> 12 
<211> 741 
<212> PRT 

<213> Acidothermus cellulolyticus 



<400> 


12 


























Ala 


Thr 


Thr 


Gin 


Pro 


Tyr 


Thr 


Trp 


Ser 


Asn 


Val 


Ala 


He Gly Gly Gly 


1 








5 










10 










15 


Gly 


Phe 


Val 


Asp 


Gly 


He 


Val 


Phe 


Asn 


Glu Gly 


Ala 


Pro Gly 


He Leu 








20 










25 










30 




Tyr 


Val 


Arg 


Thr 


Asp 


He 


Gly 


Gly 


Met 


Tyr 


Arg 


Trp 


Asp 


Ala 


Ala Asn 






35 










40 










45 






Gly 


Arg 


Trp 


He 


Pro 


Leu 


Leu 


Asp 


Trp 


Val 


Gly 


Trp 


Asn 


Asn 


Trp Gly 




50 










55 










60 








Tyr 


Asn 


Gly 


Val 


Val 


Ser 


He 


Ala 


Ala 


Asp 


Pro 


He 


Asn 


Thr 


Asn Lys 


65 










70 










75 








80 


Val 


Trp 


Ala 


Ala 


Val 


Gly 


Met 


Tyr 


Thr 


Asn 


Ser 


Trp 


Asp 


Pro 


Asn Asp 










85 










90 










95 


Gly 


Ala 


He 


Leu 


Arg 


Ser 


Ser 


Asp 


Gin 


Gly Ala 


Thr 


Trp 


Gin 


He Thr 








100 










105 










110 




Pro 


Leu 


Pro 


Phe 


Lys 


Leu 


Gly 


Gly 


Asn 


Met 


Pro 


Gly Arg Gly Met Gly 






115 










120 










125 






Glu 


Arg 


Leu 


Ala 


Val 


Asp 


Pro 


Asn 


Asn 


Asp 


Asn 


He 


Leu 


Tyr 


Phe Gly 




130 










135 










140 








Ala 


Pro 


Ser 


Gly 


Lys 


Gly 


Leu 


Trp 


Arg 


Ser 


Thr 


Asp 


Ser Gly Ala Thr 


145 










150 










155 








160 


Trp 


Ser 


Gin 


Met 


Thr 


Asn 


Phe 


Pro 


Asp 


Val 


Gly 


Thr 


Tyr 


He 


Ala Asn 










165 










170 










175 


Pro 


Thr 


Asp 


Thr 


Thr 


Gly 


Tyr 


Gin 


Ser 


Asp 


He 


Gin 


Gly Val 


Val Trp 








180 










185 










190 




Val 


Ala 


Phe 


Asp 


Lys 


Ser 


Ser 


Ser 


Ser 


Leu 


Gly 


Gin 


Ala 


Ser 


Lys Thr 



195 

He Phe Val Gly 
210 

Asp Gly Gly Ala 
225 

He Pro His Lys 

Ala Thr Ser Asn 
260 

Trp Lys Phe Ser 
275 

Pro Ser Thr Asp 
290 

He Asp Arg Gin 
305 

Trp Trp Pro Asp 

Trp Thr Arg He 
340 

Tyr Val Leu Asp 
355 

Pro Asn Pro Pro 
370 

Met Ala He Asp 
385 

Ala Thr Leu Tyr 

Gin He His He 
420 

Asn Asp Leu He 
435 

Gly Asp Leu Gly 
450 

Thr He Phe Thr 
465 

Ala Glu Leu Asn 

Ser Ser Gin Pro 
500 

Lys Asn Trp Phe 
515 

Thr Val Ala Ala 
530 

Asp Pro Gly Gin 
545 

Ala Ala Ser Gin 

Val Asn Pro Lys 
580 

Ser Thr Asp Gly 
595 

Ser Ser Gly Ala 
610 

Gly Asp Leu Trp 
625 

Gly Gly Ser Ser 



200 

Val Ala Asp Pro 
215 

Thr Trp Gin Ala 
230 

Gly Val Phe Asp 
245 

Thr Gly Gly Pro 

Val Thr Ser Gly 
280 

Thr Ala Asn Asp 
295 

His Pro Asn Thr 
310 

Thr He He Phe 
325 

Trp Asp Trp Thr 

He Ser Ala Glu 
360 

Val Pro Ser Pro 
375 

Pro Phe Asn Ser 
390 

Ala Thr Asn Asp 
405 

Ala Pro Met Val 

Ser Pro Pro Ser 
440 

Gly Phe Thr His 
455 

Ser Pro Val Phe 
470 

Pro Ser He He 
485 

Asn Asp Arg His 

Gin Gly Ser Glu 
520 

Ser Ala Asp Gly 
535 

Pro Val Val Tyr 
550 

Gly Val Pro Ala 
565 

Thr Phe Tyr Ala 

Gly Val Thr Phe 
600 

Val Gly Val Met 
615 

Leu Ala Ala Ser 
630 

Trp Ser Ala He 
645 



Asn 


Asn 


Pro 


Val 








220 


Val 


Pro 


Gly 


Ala 






235 




Pro 


Val 


Asn 


His 




250 






Tyr 


Asp 


Gly 


Ser 


265 








Thr 


Trp 


Thr 


Arg 


Tyr 


Phe 


Gly 


Tyr 








300 


He 


Met 


Val 


Ala 






315 




Arg 


Ser 


Thr 


Asp 




330 






Ser 


Tyr 


Pro 


Asn 


345 








Pro 


Trp 


Leu 


Thr 


Lys 


Leu 


Gly 


Trp 








380 


Asp 


Arg 


Met 


Leu 






395 




Leu 


Thr 


Lys 


Trp 




410 






Lys 


Gly 


Leu 


Glu 


425 








Gly 


Ala 


Pro 


Leu 


Ala 


Asp 


Val 


Thr 








460 


Thr 


Thr 


Gly 


Thr 






475 




Val 


Arg 


Ala 


Gly 




490 






Val 


Ala 


Phe 


Ser 


505 








Pro 


Gly 


Gly 


Val 


Ser 


Arg 


Phe 


Val 








540 


Ala 


Val 


Gly 


Phe 






555 




Asn 


Ala 


Gin 


He 




570 






Leu 


OCX 


Asn 












bin 


Pro 


vai 


Ala 


Phe 


His 


Ala 


Val 








620 


Ser 


Gly 


Leu 


Tyr 






635 




Thr 


Gly 


Val 


Ser 




650 







205 

Phe Trp Ser Arg 

Pro Thr Gly Phe 
240 

Val Leu Tyr He 
255 

Ser Gly Asp Val 
270 

He Ser Pro Val 
285 

Ser Gly Leu Thr 

Thr Gin He Ser 
320 

Gly Gly Ala Thr 
335 

Arg Ser Leu Arg 
350 

Phe Gly Val Gin 
365 

Met Asp Glu Ala 

Tyr Gly Thr Gly 
400 

Asp Ser Gly Gly 
415 

Glu Thr Ala Val 
430 

He Ser Ala Leu 
445 

Ala Val Pro Ser 

Ser Val Asp Tyr 
480 

Ser Phe Asp Pro 
495 

Thr Asp Gly Gly 
510 

Thr Thr Gly Gly 
525 

Trp Ala Pro Gly 

Gly Asn Ser Trp 
560 

Arg Ser Asp Arg 
575 

Thr Phe Tyr Arg 
590 

Ala Gly Leu Pro 
605 

Pro Gly Lys Glu 

His Ser Thr Asn 
640 

Ser Ala Val Asn 
655 



Val 


Gly Phe Gly 


Lys 


Ser Ala 


Pro 


Gly 


Ser 


Ser 


Tyr Pro 


Ala Val Phe 




660 








665 








670 


Val 


Val Gly Thr 


He 


Gly Gly 


Val 


Thr Gly Ala 


Tyr Arg 


Ser Asp Asp 




675 






680 








685 




Gly Gly Thr Thr 


Trp 


Val Arg 


He 


Asn 


Asp 


Asp 


Gin His 


Gin Tyr Gly 




690 




695 










700 




Asn 


Trp Gly Gin 


Ala 


He Thr 


Gly Asp 


Pro 


Arg 


He Tyr Gly Arg Val 


705 






710 








715 




720 


Tyr 


lie Gly Thr 


Asn 


Gly Arg 


Gly 


He 


Val 


Tyr Gly Asp 


He Gly Gly 






725 








730 






735 


Ala 


Pro Ser Gly 


Ser 

















740 



<210> 13 

<211> 1677 

<212> DNA 

<213> Artificial Sequence 



<220> 
<223> 



construct based on Thermobif ida fusca 



<400> 13 

gccggctgct 

accatcacca 

aaccagcagg 

gtcagcaacg 

aacggctcct 

tgcgacggct 

cccacagacc 

gtcgacaacc 

gccgctgagc 

atcggcgcca 

ctggaggagg 

aacctgcccg 

ctcgaccgct 

gactacgaga 

accaacgtgg 

tacgtcaacg 

tacatcgacg 

gacatcttct 

atctccaaca 

gttaacggcc 

ctctccttcg 

ggtatgctca 

agctcctcca 

cccggtaact 

gctcccggtg 

gaggagatcc 

ggcaacgccc 

cactggttct 



cggtggacta 
acctcggcag 
tgaccaacct 
ccccgtacaa 
actcgggcag 
cggacgaccc 
cggatgagcc 
cgttcgaggg 
cgggcggttc 
tcgagggcaa 
ccgtccgcca 
gccgcgactg 
acaagagcga 
acctgcggat 
gcgggaacgg 
gtgtcggcta 
ccgcccacca 
acgaggccgc 
cggccaacta 
agctcatccg 
tccaggacct 
tcgacacctc 
ccgacctcaa 
ggtgcaacca 
ttgacgccta 
cgaacgacga 
gcaacggcaa 
ctgcccagtt 



cacggtcaac 
tgcgatcaac 
gtggaacggg 
cgcctccatc 
caacgacatc 
cgaccccgag 
gggcggcccg 
cgccaagctg 
cgcggtcgcc 
cgacagcccg 
gtccggtggc 
cgccgcgctg 
gtacatcgac 
cgtcgccatc 
cggcaccgag 
cgccctccgc 
cggctggatc 
caacgcctcc 
ctcggccact 
ccagtccaag 
gcgtcaggcc 
ccgcaacggc 
cacctacgtt 
ggccggtgcg 
cgtctgggtg 
gggcaagggc 
caacccctcg 
ccgcgagctg 



tcctggggta 
ggctggaccc 
acctacaccc 
ccggccaacg 
ccctcctcct 
cccagcccct 
accaacccgc 
tacgtgaacc 
aacgagtcca 
accaccggct 
gacccgctga 
gcctccaacg 
ccgatcgccg 
atcgagatcg 
ctctgcgcct 
aagctgggcg 
ggctgggact 
ggctccaccg 
gtggagccgt 
tgggttgact 
ctgatcgcca 
tggggtggcc 
gacgagagcc 
ggcctcggcg 
aagcccccgg 
ttcgaccgca 
ggtgcgctgc 
ctggccaacg 



ccgggttcac 
tggagtggga 
agtccgggca 
gaacggttga 
tcaagctgaa 
cccccagccc 
ccaccaaccc 
cggtctggtc 
ccgctgtctg 
ccatgggtct 
ccatccaggt 
gtgagctggg 
acatcatgtg 
actccctgcc 
acatgaagca 
agatcccgaa 
ccaacttcgg 
tggactacgt 
acctggacgt 
ggaaccagta 
agggcttccg 
cgaaccgtcc 
gtatcgaccg 
agcggcccac 
gtgagtccga 
tgtgcgaccc 
ccaacgcccc 
cctacccgcc 



cgccaacgtc 
cttccccggc 
gcacgtgtcg 
gttcgggttc 
cggggttacc 
ttcccccagc 
cggcgagaag 
ggccaaggcc 
gctggaccgt 
gcgcgaccac 
cgtcatctac 
tcccgatgaa 
ggacttcgca 
caacctcgtc 
gaacggcggc 
cgtctacaac 
cccctcggtg 
gcacggcttc 
caacggcacc 
cgtcgacgag 
gtccgacatc 
gaccggaccg 
ccgtatccac 
ggtcaacccg 
cggcgccagc 
gacctaccag 
catctccggc 
tctgtaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320' 
1380 
1440 
1500 
1560 
1620 
1677 



<210> 14 

<211> 558 

<212> PRT 

<213> Artificial Sequence 



<220> 



<223> construct based on Thermobif ida fusca 



<400> 14 



Ala 


Gly 


Cys 


Ser 


Val 


Asp 


Tyr 


Thr 


Val 


Asn 


Ser 


Trp 


Gly 


Thr 


Gly 


Phe 


1 








5 










10 










15 




Thr 


Ala 


Asn 


Val 


Thr 


He 


Thr 


Asn 


Leu 


Gly 


Ser 


Ala 


He 


Asn 


Gly 


Trp 








20 










25 










30 






Thr 


Leu 


Glu 


Trp 


Asp 


Phe 


Pro 


Gly 


Asn 


Gin 


Gin 


Val 


Thr 


Asn 


Leu 


Trp 






35 










40 










45 








Asn 


Gly 


Thr 


Tyr 


Thr 


Gin 


Ser 


Gly 


Gin 


His 


Val 


Ser 


Val 


Ser 


Asn 


Ala 




50 










55 










60 










Pro 


Tyr 


Asn 


Ala 


Ser 


He 


Pro 


-Ala 


Asn 


Gly 


Thr 


Val 


Glu 


Phe 


Gly 


Phe 


65 










70 










75 










80 


Asn 


Gly 


Ser 


Tyr 


Ser 


Gly 


Ser 


Asn 


Asp 


He 


Pro 


Ser 


Ser 


Phe 


Lys 


Leu 










85 










90 










95 




Asn 


Gly 


Val 


Thr 


Cys 


Asp 


Gly 


Ser 


Asp 


Asp 


Pro 


Asp 


Pro 


Glu 


Pro 


Ser 








100 










105 










110 






Pro 


Ser 


Pro 


Ser 


Pro 


Ser 


Pro 


Ser 


Pro 


Thr 


Asp 


Pro 


Asp 


Glu 


Pro 


Gly 






115 










120 










125 








Gly 


Pro 


Thr 


Asn 


Pro 


Pro 


Thr 


Asn 


Pro 


Gly 


Glu 


Lys 


Val 


Asp 


Asn 


Pro 




130 










135 










140 










Phe 


Glu 


Gly 


Ala 


Lys 


Leu 


Tyr 


Val 


Asn 


Pro 


Val 


Trp 


Ser 


Ala 


Lys 


Ala 


145 










150 










155 










160 


Ala 


Ala 


Glu 


Pro 


Gly 


Gly 


Ser 


Ala 


Val 


Ala 


Asn 


Glu 


Ser 


Thr 


Ala 


Val 










165 










170 










175 




Trp 


Leu 


Asp 


Arg 


He 


Gly 


Ala 


He 


Glu 


Gly 


Asn 


Asp 


Ser 


Pro 


Thr 


Thr 








180 










185 










190 






Gly 


Ser 


Met 


Gly 


Leu 


Arg 


Asp 


His 


Leu 


Glu 


Glu 


Ala 


Val 


Arg 


Gin 


Ser 






195 










200 










205 








Gly 


Gly 


Asp 


Pro 


Leu 


Thr 


He 


Gin 


Val 


Val 


He 


Tyr 


Asn 


Leu 


Pro 


Gly 




210 










215 










220 










Arg 


Asp 


Cys 


Ala 


Ala 


Leu 


Ala 


Ser 


Asn 


Gly 


Glu 


Leu 


Gly 


Pro 


Asp 


Glu 


225 










230 










235 










240 


Leu 


Asp 


Arg 


Tyr 


Lys 


Ser 


Glu 


Tyr 


He 


Asp 


Pro 


He 


Ala 


Asp 


He 


Met 










245 










250 










255 




Trp 


Asp 


Phe 


Ala 


Asp 


Tyr 


Glu 


Asn 


Leu 


Arg 


He 


Val 


Ala 


He 


He 


Glu 








260 










265 










270 






He 


Asp 


Ser 


Leu 


Pro 


Asn 


Leu 


Val 


Thr 


Asn 


Val 


Gly 


Gly 


Asn 


Gly 


Gly 






275 










280 










285 








Thr 


Glu 


Leu 


Cys 


Ala 


Tyr 


Met 


Lys 


Gin 


Asn 


Gly 


Gly 


Tyr 


Val 


Asn 


Gly 




290 










295 










300 










Val 


Gly 


Tyr 


Ala 


Leu 


Arg 


Lys 


Leu 


Gly 


Glu 


He 


Pro 


Asn 


Val 


Tyr 


Asn 


305 










310 










315 










320 


Tyr 


He 


Asp 


Ala 


Ala 


His 


His 


Gly 


Trp 


He 


Gly 


Trp 


Asp 


Ser 


Asn 


Phe 










325 










330 










335 




Gly 


Pro 


Ser 


Val 


Asp 


He 


Phe 


Tyr 


Glu 


Ala 


Ala 


Asn 


Ala 


Ser 


Gly 


Ser 








340 










345 










350 






J. i IX 






T\/T" 


Val 


His 


Gly 


Phe 


He 


Ser 


Asn 


Thr 


Ala 


Asn 


Tvr 


Ser 






355 










360 










365 








Ala 


Thr 


Val 


Glu 


Pro 


Tyr 


Leu 


Asp 


Val 


Asn 


Gly 


Thr 


Val 


Asn 


Gly 


Gin 




370 










375 










380 










Leu 


He 


Arg 


Gin 


Ser 


Lys 


Trp 


val 


Asp 


Trp 


Asn 


Gin 


Tyr 


Val 


Asp 


Glu 


385 










390 










395 










400 


Leu 


Ser 


Phe 


Val 


Gin 


Asp 


Leu 


Arg 


Gin 


Ala 


Leu 


He 


Ala 


Lys 


Gly 


Phe 










405 










410 










415 




Arg 


Ser 


Asp 


He 


Gly 


Met 


Leu 


He 


Asp 


Thr 


Ser 


Arg 


Asn 


Gly 


Trp 


Gly 








420 










425 










430 







Gly Pro Asn Arg 
435 

Tyr Val Asp Glu 
450 

Cys Asn Gin Ala 
465 

Ala Pro Gly Val 

Asp Gly Ala Ser 
500 

Arg Met Cys Asp 
515 

Pro Ser Gly Ala 
530 

Ala Gin Phe Arg 

545 • 



Pro 


Thr 


Gly 


Pro 








440 


Ser 


Arg 


He 


Asp 






455 




Gly 


Ala 


Gly 


Leu 




470 






Asp 


Ala 


Tvr 


Val 


485 








Glu 


Glu 


He 


Pro 


Pro 


Thr 


Tyr 


Gin 








520 


Leu 


Pro 


Asn 


Ala 






535 




Glu 


Leu 


Leu 


Ala 




550 







Ser 


Ser 


Ser 


Thr 


Arg 


Arg 


He 


His 








460 


Gly 


Glu 


Arg 


Pro 






475 




Trp 


Val 


Lys 


Pro 




490 






Asn 


Asp 


Glu 


Gly 


505 








Gly 


Asn 


Ala 


Arg 


Pro 


He 


Ser 


Gly 








540 


Asn 


Ala 


Tyr 


Pro 






555 





Asp Leu Asn Thr 
445 

Pro Gly Asn Trp 

Thr Val Asn Pro 
480 

Pro Gly Glu Ser 
495 

Lys Gly Phe Asp 
510 

Asn Gly Asn Asn 
525 

His Trp Phe Ser 
Pro Leu 



<210> 15 

<211> 1293 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> construct based on Thermobifida fusca 



<400> 15 

gccggtctca 

gtcaccgtcc 

cccggcggca 

cacaccttca 

ggcttcatcg 

gacgaaggct 

acgcagcccg 

cagctctgcg 

cagtggttcg 

gccgacatca 

ggcttcaccg 

atcgtggact 

accttcttcg 

gccaacgaac 

ccggtgatcc 

tcgctcggcg 

tccaacatca 

gcgctgcgtg 

tacaccggtg 

gaacggaaga 

ttccagccgg 

ggacagtggg 



ccgccacagt 
gcaacgacac 
ctacagtggc 
ccggggtttc 
cttccggcag 
ccgagccggg 
gcaccggcac 
acgagcacgg 
accactgcct 
tccgcctgtc 
accggatgca 
ggcacatcct 
cggaaatcgc 
ccaacggagt 
gccagcgcga 
tctccgaagg 
tgtacgcctt 
aggcctccga 
acggcgccaa 
tcgggtggac 
gcacctgcgc 
tgcggagcaa 



caccaaagaa 
ctcgagcacc 
ccaggtgtgg 
ctggaacagc 
cggcgaaccc 
cggccccggc 
cccggtcgag 
caacccggtc 
gaccgacagc 
catgtacatc 
ccagctcatc 
caccccgggc 
ccagcgccac 
gagctgggcc 
ccccgactcg 
ctccggcccc 
ccacttctac 
gctgttcccg 
cgacttccag 
caagtggaac 
gtccggcggc 
gctccagtcc 



tcctcgtggg 
gtctcccagt 
aacgcccagc 
accatcccgc 
acccactgca 
ggtcccggaa 
cggtacggca 
caactgcgcg 
tcgctggacg 
caggaagacg 
gacatggcca 
gatccccact 
gccagcaaga 
tccatcaaga 
gtgatcatcg 
gccgagatcg 
gcggcctcgc 
gtcttcgtca 
atggccgacc 
tactcggacg 
ccgtggagcg 
tga 



acaacggcta 
gggaggtcgt 
acaccagcag 
ccggaggcac 
ccatcaacgg 
ccccctcccc 
aagtccaggt 
gcatgagcac 
ccctggccta 
gctacgagac 
cggcgcgcgg 
acaacctgga 
ccaacgtgct 
gctacgccga 
tgggcacccg 
cggccaaccc 
accgcgacaa 
ccgagttcgg 
gctacatcga 
acttccgttc 
gttcgtcgct 



ctccgcgtcc 
cctcaccctg 
cggcaactcc 
cgcctcttcc 
cgccccctgc 
cgaccccggc 
ctgcggcacc 
ccacggcatc 
cgactggaag 
caacccgcgc 
cctgtacgtg 
ccgggccaag 
ctacgagatc 
agaggtcatc 
cggctggtcg 
ggtcaacgcc 
ctacctcaac 
caccgagacc 
cctgatggcg 
cggcgcggtc 
gaaggcgtcc 



60 
120 
180 
2^0 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1293 



<210> 16 

<211> 430 

<212> PRT 

<213> Artificial Sequence 



<220> 

<223> construct based on Thermobifida fusca 



<400> 16 



Al a 

n 1 d 


Gly Leu 


Thr 


Ala 


Thr 


Val 


Thr 


Lys 


Glu 


Ser 


Ser 


Tro 

x 1 p 


Asp 


Asn 


Glv 

oxy 


1 








5 










10 










15 




T\7r 

i yx 


Ser 


Ala 


OCX 


Va 1 

vdl 


Th r 

X 111 


Va 1 

V CI 1 


Arg 


Asn 


Asp 


Thr 


Ser 


Ser 


Thr 


Val 


Ser 








20 










25 










30 






pi n 


Tro 


Glu 


Val 


Val 

V G 1 


T ,pi i 


Thr 


Leu 


Pro 


0 1 y 


Gly 


Thr 


Thr 


Val 


Ala 


Gin 






35 










40 










45 








Val 


j. x p 


Asn 


Ala 


Gin 


His 


Thr 


Ser 


Ser 


Gly 


Asn 


Ser 


His 


Thr 


Phe 


Thr 




50 










55 










60 










P"| \/ 

oxy 


Val 


Ser 


Tro 
lip 


noil 


OCX 


Thr 
1 in 


Tip 

X X c 


Pro 


Pro 

IT 1 O 


P,l V 

vj 1 y 


Glv 

0 iy 


Thr 


Ala 


Qpr 

O Cl 


Qpr 

O Cl 


65 










70 










75 










80 


pi V 


Phe 


lie 


Ala 


Ser 


Gl V 
01 y 


Ser 


Gly 


Glu 


Pro 


Thr 


His 


Cvs 

oy 0 


Thr 


lie 


Asn 










85 










90 










95 






Ala 


Pro 


v_y 0 


Asp 


Glu 


Gly 


Ser 


Glu 


Pro 


Gly 


Glv 

oxy 


Pro 


Gly 


Glv 

oxy 


Pro 








100 










105 










110 






Pi v 

oxy 


Thr 


Pro 


C p y- 
OCX 


i 1 


A CO 
nop 


Pro 


Pi V 
oxy 


Thr 


Gin 


Pro 


Glv 

oxy 


Thr 


Glv 

oxy 


Thr 


Pro 






115 










120 










125 








val 

V a 1 


Glu 


Arg 


1 yx 


pi v 

oiy 


T A7C 
xjy 0 


Va 1 
v a. 1 


Gin 


Val 


oy & 


Pi v 

oxy 


Thr 


Gin 


Leu 


Pvc 
v_y 0 


Asp 




13 0 










135 










140 










pi n 

O 1 U 


His 


Gly 


noil 


i 1 vj 


Va 1 

val 


pi n 

OX11 


T .pi i 
JjCU 


A r 0 
ni y 


fil V 
oxy 




0 cx 


Thr 


His 


HI V 

oxy 


Tie 

X X c 


1 4 R 
j. j 










150 










155 










160 


O XI 1 


Trn 
U P 


Phe 




Hi c 
nib 


P\7C 

v^yo 


T .pi 1 


Thr 

1 1J.X 


A cn 
nop 


OCX 


OC1 


T 1P11 


A cn 
nop 


Ala 


ucu 


Ala 










1 u J 










170 

X / V 










175 




iyr 


A Qn 
nop 


Tro 


T A/C 

xjy 0 


nla 


A cn 
nop 


Tl p 

X X c 


Tip 

X X c 


A r rr 
ni y 


T , pi i 


O Cl 


MPt 

1*1 C L. 


Tvr 
lyx 


Tip 

X X c 


PI n 

OX 11 


pi 11 








IOU 










1 

10 j 










1 QD 

X «7 VJ 






no|J 


Pi V 


Tvr 
1 y x 


ui u 


1 111. 


A 0 ti 
noil 


Pro 
i 1 \j 


A r rr 
nx y 


PI V 

oxy 


Php 


Thr 
1 111 


A co 
nop 


A r 0* 
nx y 


Mpf 

11C L. 


Hi c 
oxo 


PI n 

OXil 






1 9S 

X J -J 




















20S 
^» \j ~j 








T .01 1 
XjcU 


lie 


Asp 


Mpr 

1\IC L. 


AT a 

nla 


Th y 
1 111 


Al a 
nla 


A r rr 
nx y 


Pi V 

oxy 


T .pi 1 
JjCU 


T\/r 
lyx 


Va 1 
v a x 


Tip 

X X c 


Va 1 
v a x 


A co 
nop 


Tro 
1 xp 




210 










215 










220 










Hi C 
nib 


lie 


Leu 


Thr 


Pro 


Pi V 
0 x y 


Asp 


Pro 


His 


1 yx 


Asn 


Leu 


ASp 


Arg 


Ala 


Lys 


225 










230 










235 










240 


Th V" 

1 1 IX 


Phe 


Phe 


Ala 


Glu 


lie 


Ala 


Gin 


Arg 


His 


Ala 


Ser 


Lys 


Thr 


Asn 


Val 




















250 










255 




Lj c u 


Tvr 

j. y x 


Glu 


Tip 

lie 


Al ^ 


A cn 

noli 


c;l ii 

Old 


Pro 
" 1 0 


Asn 


PI V 

oxy 


Val 

v a x 


OCX 


Tro 
1 x p 


Ala 


Qpr 

OCX 


Tip 

X X c 








260 










265 










270 






T A/C 


Ser 


Tvr 
j. y x 


Al a 

nla 


pi 11 

oxu 


pi 11 

ox U 


Va 1 


T1p 

X X c 


Pro 
it 1 0 


Va 1 
val 


Tip 

X X c 


Aro 
ni y 


PI n 
0x11 


Aro 
ni y 


A co 
nop 


Pro 

IrX O 






27 S 










280 
t> 0 \j 










285 








ASp 


Ser 


Val 


Tip 
lie 


Tip 
X X c 


Va 1 

vcix 


oxy 


Th r 
1 111 


A y ci 
nx y 


PI \7 

oxy 


Tro 
lip 


Qpr 
OCX 


Qpr 

OCX 


T .pi 1 


Pi \7 

oxy 


Val 
v a x 




290 










<L J ~) 










300 










OcX 


Glu 


Gly 


OCX 


pi v 

oiy 


JrX O 


Al a 

nla 


Pi 11 

oxu. 


Tip 

X X c 


Al a 
nx a 


Al a 
n x a 


A cn 
nbii 


Pro 

JrX 0 


Val 
v a x 


A cn 
noil 


Al a 
nx a 


305 










310 










315 










320 


Co v- 


Asn 


lie 


1*1 c L 


Twr 

iyr 


Al a 

nla 


pVip 
rile 


Hi c 
n 1 0 


Php 
iri 1C 


iyr 


Al a 
nx a 


Al a 
nx a 


OCX 


Hi c 
n x 0 


A ro 
nX y 


A co 
nop 










325 










330 










335 




Asn 


T\7r 


Leu 


Asn 


Al a 
nX a 


T .01 i 


A vo 

nx y 


Pi n 
OXU 


Ala 

nla 


Qpr 

OCX 


PI 11 
ox u 


T ,01 1 

JjCU 


Php 
.rile 


Pro 
JrX KJ 


Va 1 
v a 1 


Php 


















"34 R 










1 RO 






vdl 


Thr 

1 1 IX 




Ph 0 

Jrne 


bly 


Tnr 


oxu 


1 nx 


iyr 


Thr 
1 Ilx 


Pi \7 

oxy 


Asp 


Pi \7 

oxy 


Ala 
n.X a 


Asn 


Asp 






i bb 










7 fin 










J D 








Phe 


Gin 


Met 


71 T _ 
Ala 


Asp 


Arg 


Tyr 


t1 . 

lie 


Asp 


Leu 


Met 


Ala 


Glu 


Arg 


Lys 


lie 




370 










375 










380 










Gly 


Trp 


Thr 


Lys 


Trp 


Asn 


Tyr 


Ser 


Asp 


Asp 


Phe 


Arg 


Ser 


Gly 


Ala 


Val 


385 










390 










395 










400 


Phe 


Gin 


Pro 


Gly 


Thr 


Cys 


Ala 


Ser 


Gly 


Gly 


Pro 


Trp 


Ser 


Gly 


Ser 


Ser 










405 










410 










415 




Leu 


Lys 


Ala 


Ser 


Gly 


Gin 


Trp 


Val 


Arg 


Ser 


Lys 


Leu 


Gin 


Ser 












420 










425 










430 







<210> 17 



<211> 2656 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> fusion construct 
<400> 17 

atgtatcgga agttggccgt catctcggcc ttcttggcca cagctcgtgc tcagtcggcc 60 

tgcactctcc aatcggagac tcacccgcct ctgacatggc agaaatgctc gtctggtggc 12 0 

acttgcactc aacagacagg ctccgtggtc atcgacgcca actggcgctg gactcacgct 180 

acgaacagca gcacgaactg ctacgatggc aacacttgga gctcgaccct atgtcctgac 240 

aacgagacct gcgcgaagaa ctgctgtctg gacggtgccg cctacgcgtc cacgtacgga 3 00 

gttaccacga gcggtaacag cctctccatt ggctttgtca cccagtctgc gcagaagaac 360 

gttggcgctc gcctttacct tatggcgagc gacacgacct accaggaatt caccctgctt 420 

ggcaacgagt tctctttcga tgttgatgtt tcgcagctgc cgtaagtgac ttaccatgaa 4 80 

cccctgacgt atcttcttgt gggctcccag ctgactggcc aatttaaggt gcggcttgaa 540 

cggagctctc tacttcgtgt ccatggacgc ggatggtggc gtgagcaagt atcccaccaa 600 

caccgctggc gccaagtacg gcacggggta ctgtgacagc cagtgtcccc gcgatctgaa 660 

gttcatcaat ggccaggcca acgttgaggg ctgggagccg tcatccaaca acgcaaacac 720 

•gggcattgga ggacacggaa gctgctgctc tgagatggat atctgggagg ccaactccat 7 80 

ctccgaggct cttacccccc acccttgcac gactgtcggc caggagatct gcgagggtga 840 

tgggtgcggc ggaacttact ccgataacag atatggcggc acttgcgatc ccgatggctg 900 

cgactggaac ccataccgcc tgggcaacac cagcttctac ggccctggct caagctttac 960 

• cctcgatacc accaagaaat tgaccgttgt cacccagttc gagacgtcgg gtgccatcaa 1020 

ccgatactat gtccagaatg gcgtcacttt ccagcagccc aacgccgagc ttggtagtta 1080 

ctctggcaac gagctcaacg atgattactg cacagctgag gaggcagaat tcggcggatc 114 0 

ctctttctca gacaagggcg gcctgactca gttcaagaag gctacctctg gcggcatggt 1200 

tctggtcatg agtctgtggg atgatgtgag tttgatggac aaacatgcgc gttgacaaag 1260 

agtcaagcag ctgactgaga tgttacagta ctacgccaac atgctgtggc tggactccac 1320 

ctacccgaca aacgagacct cctccacacc cggtgccgtg cgcggaagct gctccaccag 1380 

ctccggtgtc cctgctcagg tcgaatctca gtctcccaac gccaaggtca ccttctccaa 1440 

catcaagttc ggacccattg gcagcaccgg caaccctagc ggcggcaacc ctcccggcgg 1500 

aaacccgcct ggcaccacca ccacccgccg cccagccact accactggaa gctctcccgg 1560 

acctactagt aagcgggcgg gcggcggcta ttggcacacg agcggccggg agatcctgga 1620 

cgcgaacaac gtgccggtac ggatcgccgg catcaactgg tttgggttcg aaacctgcaa ' 1680 

ttacgtcgtg cacggtctct ggtcacgcga ctaccgcagc atgctcgacc agataaagtc 1740 

gctcggctac aacacaatcc ggctgccgta ctctgacgac attctcaagc cgggcaccat 1800 

gccgaacagc atcaattttt accagatgaa tcaggacctg cagggtctga cgtccttgca 1860 

ggtcatggac aaaatcgtcg cgtacgccgg tcagatcggc ctgcgcatca ttcttgaccg 1920 

ccaccgaccg gattgcagcg ggcagtcggc gctgtggtac acgagcagcg tctcggaggc 1980 

tacgtggatt tccgacctgc aagcgctggc gcagcgctac aagggaaacc cgacggtcgt 2040 

cggctttgac ttgcacaacg agccgcatga cccggcctgc tggggctgcg gcgatccgag 2100 

catcgactgg cgattggccg ccgagcgggc cggaaacgcc gtgctctcgg tgaatccgaa 2160 

cctgctcatt ttcgtcgaag gtgtgcagag ctacaacgga gactcctact ggtggggcgg 2220 

caacctgcaa ggagccggcc agtacccggt cgtgctgaac gtgccgaacc gcctggtgta 2280 

ctcggcgcac gactacgcga cgagcgtcta cccgcagacg tggttcagcg atccgacctt 2340 

ccccaacaac atgcccggca tctggaacaa gaactgggga tacctcttca atcagaacat 2400 

tgcaccggta tggctgggcg aattcggtac gacactgcaa tccacgaccg accagacgtg 2460 

gctgaagacg ctcgtccagt acctacggcc gaccgcgcaa tacggtgcgg acagcttcca 2520 

gtggaccttc tggtcctgga accccgattc cggcgacaca ggaggaattc tcaaggatga 2580 

ctggcagacg gtcgacacag taaaagacgg ctatctcgcg ccgatcaagt cgtcgatttt 2640 

cgatcctgtc ggctaa 2656 

<210> 18 
<211> 841 
<212> PRT 



<213> Artificial Sequence 
<220> 

<223> fusion construct 



<400> 18 



Met 


Tyr 


Arg 


Lys 


Leu 


Ala 


Val 


He 


Ser 


Ala 


Phe 


Leu 


Ala 


Thr 


Ala 


Arg 


1 








5 










10 










15 




Ala 


Gin 


Ser 


Ala 


Cys 


Thr 


Leu 


Gin 


Ser 


Glu 


Thr 


His 


Pro 


Pro 


Leu 


Thr 








20 










25 










30 






Trp 


Gin 


Lys 


Cys 


Ser 


Ser 


Gly 


Gly 


Thr 


Cys 


Thr 


Gin 


Gin 


Thr 


Gly 


Ser 






35 










40 










45 








Val 


Val 


He 


Asp 


Ala 


Asn 


Trp 


Arg 


Trp 


Thr 


His 


Ala 


Thr 


Asn 


Ser 


Ser 




50 










55 










60 










Thr 


Asn 


Cys 


Tyr 


Asp 


Gly 


Asn 


Thr 


Trp 


Ser 


Ser 


Thr 


Leu 


Cys 


Pro 


Asp 


65 










70 










75 










80 


Asn 


Glu 


Thr 


Cys 


Ala 


Lys 


Asn 


,Cys 


Cys 


Leu 


Asp 


Gly 


Ala 


Ala 


Tyr 


Ala 










85 










90 










95 




Ser 


Thr 


Tyr 


Gly 


Val 


Thr 


Thr 


Ser 


Gly 


Asn 


Ser 


Leu 


Ser 


He 


Gly 


Phe 








100 










105 










110 






Val 


Thr 


Gin 


Ser 


•Ala 


Gin 


Lys 


Asn 


Val 


Gly 


Ala 


Arg 


Leu 


Tyr 


Leu 


Met 






115 










120 










125 








Ala 


Ser 


Asp 


Thr 


Thr 


Tyr 


Gin 


Glu 


Phe 


Thr 


Leu 


Leu 


Gly 


Asn 


Glu 


Phe 




.130 










135 










140 










Ser 


Phe 


Asp 


Val 


Asp 


Val 


Ser 


Gin 


Leu 


Pro 


Cys 


Gly 


Leu 


Asn 


Gly 


Ala 


145 










150 










155 










160 


Leu 


Tyr 


Phe 


Val 


Ser 


Met 


Asp 


Ala 


Asp 


Gly 


Gly 


Val 


Ser 


Lys 


Tyr 


Pro 










165 










170 










175 




Thr 


Asn 


Thr 


Ala 


Gly 


Ala 


Lys 


Tyr 


Gly 


Thr 


Gly 


Tyr 


Cys 


Asp 


Ser 


Gin 








180 










185 










190 






Cys 


Pro 


Arg 


Asp 


Leu 


Lvs 


Phe 


He 


Asn 


Gly 


Gin 


Ala 


Asn 


Val 


Glu 


Gly 






195 










200 










205 








Trp 


Glu 


Pro 


Ser 


Ser 


Asn 


Asn 


Ala 


Asn 


Thr 


Gly 


He 


Gly 


Gly 


His 


Gly 




210 










215 










220 










Ser 


Cys 


Cvs 
y ^ 


Ser 


Glu 


Met 


Asp 


He 


Trp 


Glu 


Ala 


Asn 


Ser 


He 


Ser 


Glu 


225 










230 










235 










240 


Ala 


Leu 


Thr 


Pro 


His 


Pro 


Cys 


Thr 


Thr 


Val 


Gly 


Gin 


Glu 


He 


Cys 


Glu 










245 










250 










255 




Gly 


Asp 


Gly 


Cys 


Gly 


Gly 


Thr 


Tyr 


Ser 


Asp 


Asn 


Arg 


Tyr 


Gly 


Gly 


Thr 








260 










265 










270 






Cys 


Asp 


Pro 


Asp 


Gly 


Cys 


Asp 


Trp 


Asn 


Pro 


Tyr 


Arg 


Leu 


Gly 


Asn 


Thr 






275 










280 










285 








Ser 


Phe 


Tvr 


Glv 


Pro 


Glv 
y 


Ser 


Ser 


Phe 


Thr 


Leu 


Asp 


Thr 


Thr 


Lvs 


LVS 




C» Zf U 










295 










300 










Leu 


Thr 


Val 


Val 


Thr 


Gin 


Phe 


Glu 


Thr 


Ser 


Gly 


Ala 


He 


Asn 


Arg 


Tyr 


305 










310 










315 










320 


Tyr 


Val 


Gin 


Asn 


Gly 


Val 


Thr 


Phe 


Gin 


Gin 


Pro 


Asn 


Ala 


Glu 


Leu 


Gly 










325 










330 










335 




Ser 


Tyr 


Ser 


Gly 


Asn 


Glu 


Leu 


Asn 


Asp 


Asp 


Tyr 


Cys 


Thr 


Ala 


Glu 


Glu 








340 










345 










350 






Ala 


Glu 


Phe 


Gly 


Gly 


Ser 


Ser 


Phe 


Ser 


Asp 


Lys 


Gly 


Gly 


Leu 


Thr 


Gin 






355 










360 










365 








Phe 


Lys 


Lys 


Ala 


Thr 


Ser 


Gly 


Gly 


Met 


Val 


Leu 


Val 


Met 


Ser 


Leu 


Trp 




370 










375 










380 










Asp 


Asp 


Tyr 


Tyr 


Ala 


Asn 


Met 


Leu 


Trp 


Leu 


Asp 


Ser 


Thr 


Tyr 


Pro 


Thr 


385 










390 










395 










400 


Asn 


Glu 


Thr 


Ser 


Ser 


Thr 


Pro 


Gly 


Ala 


Val 


Arg 


Gly 


Ser 


Cys 


Ser 


Thr 



405 



JCI 


Ser Gly 


Val 


Pro 


r\± d 


olll 


V OL± 








420 










vai 


1 XIX 




Ser 


Asn 


lie 


Lys 


Jriie 






A *3 
41 o D 










a a n 


Pro 


Ser 


biy 


Gly Asn 


Pro 


Pro 


biy 




/i n 
4 d u 










455 




lnr 


Arg 


Arg 


Pro 


Ala 


Thr 


Thr 


m*U -v- 

i nr 


A £ c. 
4 O D 










470 






Lys 


Arg 


7A 1 a 
/A ± d 


Gly Gly Gly Tyr 


Trp 










/IOC 

4od 








ASp 


Ala. 


Asn 


Asn 


val 


■ Pro 


val 


Arg 








500 










Pile 




i nr 


Cys 


Asn 


Tyr 


Val 


vai 






DID 










con 
D A U 


Arg 


Car 

oer 


Mo t- 


Leu 


Asp 


Gin 


lie 


Lys 




con 










C "5 (T 

53 5 




Leu 


Pro 


Tyr 


Ser 


Asp 


Asp 


He 


Leu 


j4 j 










550 






lie 


Asn 


irilfcr 


Tyr 


Gin 


Met 


Asn 


bin 










565 








bin 


val 


jxiec 


Asp 


Lys 


He 


Val 


Aia 








580 










lie 


lie 


Leu 


Asp 


Arg 


His 


Arg 


Pro 






d y d 










duu 


Trp 


Tyr 


i nr 


Ser 


Ser 


Val 


Ser 


blU 




bill 










615 




Ala 


Leu 


Ala 


Gin 


Arg 


Tyr 


Lys 


Gly 


DZD 










630 






Leu 


His 


Asn 


Glu 


Pro 


His 


Asp 


Pro 










645 








Ser 


He 


Asp 


Trp 


Arg 


Leu 


Ala 


Ala 








660 










Ser 


Val 


Asn 


Pro 


Asn 


Leu 


Leu 


lie 






675 










c o n 
Dot) 


Asn 


Gly Asp 


Ser 


Tyr 


Trp 


Trp 


Gly 




690 










695 




Tyr 


Pro 


Val- 


Val 


Leu 


Asn 


Val 


Pro 


n n c 

/ Ud 










710 






Asp 


Tyr 


Ala 


Thr 


Ser 


Val 


Tyr 


Pro 










725 








Phe 


Pro 


Asn 


Asn 


Met 


Pro Gly 


He 








740 










Phe 


Asn 


Gin 


Asn 


He 


Ala 


Pro 


Val 






755 










760 


Leu 


Gin 


Ser 


Thr 


Thr 


Asp 


Gin 


Thr 




770 










775 




Leu 


Arg 


Pro 


Thr Ala Gin Tyr Gly 


7.85 










790 






Trp 


Ser 


Trp 


Asn 


Pro 


Asp 


Ser Gly 










805 








Asp 


Trp 


Gin 


Thr 


Val 


Asp 


Thr 


Val 








820 










Lys 


Ser 


Ser 


He 


Phe 


Asp 


Pro 


Val 






835 










840 





410 










*± JL D 




Glu 


Ser 


Gin 


Ser 


Pro 


Asn 




Lys 


425 










430 






Gly 


Pro 


He Gly 


Ser 


Thr 


bxy 


Asn 










445 








Gly Asn 


Pro 


Pro 


Gly Thr 


Tnr 


Tnr 








460 










biy 


Ser 


Ser 


Pro 


Gly 


Pro 


i nr 


oexr 






475 










/l on 


HIS 


i nr 


Ser Gly Arg Glu 


lie 


Leu 




A Q fi 

4 y u 










/IOC 

4 y d 




lie 


Ala 
Aid 


Gly 


He 


Asn 


Trp 


irne 


biy 


enc 
DUD 










CI n 
Dl U 






MIS 


biy 


Leu 


Trp 


Ser 


Arg 


Asp 


Tyr 










525 








Ser 


Leu 


Gly 


Tyr 


Asn 


Thr 


lie 


Arg 








540 










Lys 


Pro 


Gly Thr Met 


Pro 


Asn 


Ser 






555 










c <r n 
Dot) 


Asp 


Leu 


Gin 


Gly Leu 


Thr 


Ser 


Leu 




CTA 

D / U 










CTC 

D / D 




Tyr 


Aia 


Gly 


Gin, 


He 


Gly 


Leu 


Arg 


coc 
Dob 










590 






Asp 


Cys 


Ser 


Gly Gin 


Ser 


Aia 


Leu 










605 








Ala 


Thr 


Trp 


He 


Ser 


Asp 


Leu 


bin 








620 










Asn 


Pro 


Thr 


Val 


Val 


Gly 


Pne 


Asp 






635 










C A A 


Ala 


Cys 


Trp 


Gly Cys 


Gly 


Asp 


Pro 




650 










ODD 




Glu 


Arg 


Ala Gly Asn 


Ala 


val 


Leu 


r r c 

665 










670 






Phe 


Val 


Glu 


Gly Val 


Gin 


Ser 


Tyr 










685 








Gly 


Asn 


Leu 


Gin 


Gly Ala 


biy 


bin 








700 










Asn 


Arg 


Leu 


Val 


Tyr 


Ser 


Ala 


His 






715 










ion 


bin 


Thr 


Trp 


Phe 


Ser 


Asp 


Pro 


i nr 




730 










/ J J 




Trp 


Asn 


Lys 


Asn 


Trp Gly 


Tyr 


Leu 


745 










750 






Trp 


Leu 


Gly 


Glu 


Phe 


Gly 


i nr 


Thr 










765 








Trp 


Leu 


Lys 


Thr 


Leu 


Val 


bin 


Tyr 








780 










Ala 


Asp 


Ser 


Phe 


Gin 


Trp 


Thr 


Phe 






795 










800 


Asp 


Thr 


Gly Gly 


He 


Leu 


Lys 


Asp 




810 










815 




Lys 


Asp 


Gly 


Tyr 


Leu 


Ala 


Pro 


He 



825 830 
Gly 



<210> 19 



<211> 10239 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> pTrex plasmid 
<400> 19 

aagcttaact agtacttctc gagctctgta catgtccggt cgcgacgtac gcgtatcgat 60 

ggcgccagct gcaggcggcc gcctgcagcc acttgcagtc ccgtggaatt ctcacggtga 120 

atgtaggcct tttgtagggt aggaattgtc actcaagcac ccccaacctc cattacgcct 180 

cccccataga gttcccaatc agtgagtcat ggcactgttc tcaaatagat tggggagaag 240 

ttgacttccg cccagagctg aaggtcgcac aaccgcatga tatagggtcg gcaacggcaa 3 00 

aaaagcacgt ggctcaccga aaagcaagat gtttgcgatc taacatccag gaacctggat 3 60 

acatccatca tcacgcacga ccactttgat ctgctggtaa actcgtattc gccctaaacc 420 

gaagtgacgt ggtaaatcta cacgtgggcc cctttcggta tactgcgtgt gtcttctcta 4 80 

ggtgccattc ttttcccttc ctctagtgtt gaattgtttg tgttggagtc cgagctgtaa 540 

ctacctctga atctctggag aatggtggac taacgactac cgtgcacctg catcatgtat 600 

ataatagtga tcctgagaag gggggtttgg agcaatgtgg gactttgatg gtcatcaaac 660 

aaagaacgaa gacgcctctt ttgcaaagtt ttgtttcggc tacggtgaag aactggatac 720 

ttgttgtgtc ttctgtgtat ttttgtggca acaagaggcc agagacaatc tattcaaaca 780 

ccaagcttgc tcttttgagc tacaagaacc tgtggggtat atatctagag ttgtgaagtc R4 0 

ggtaatcccg ctgtatagta atacgagtcg catctaaata ctccgaagct gctgcgaacc 900 

cggagaatcg agatgtgctg gaaagcttct agcgagcggc taaattagca tgaaaggcta 960 

■ tgagaaattc tggagacggc ttgttgaatc atggcgttcc attctbcgac aagcaaagcg 1020 

ttccgtcgca gtagcaggca ctcattcccg aaaaaactcg gagattccta agtagcgatg 1080 

gaaccggaat aatataatag gcaatacatt gagttgcctc gacggttgca atgcaggggt 1140 

actgagcttg gacataactg ttccgtaccc cacctcttct caacctttgg cgtttccctg 1200 

attcagcgta cccgtacaag tcgtaatcac tattaaccca gactgaccgg acgtgttttg 1260 

cccttcattt ggagaaataa tgtcattgcg atgtgtaatt tgcctgcttg accgactggg 1320 

gctgttcgaa gcccgaatgt aggattgtta tccgaactct gctcgtagag gcatgttgtg 1380 

aatctgtgtc gggcaggaca cgcctcgaag gttcacggca agggaaacca ccgatagcag 1440 

tgtctagtag caacctgtaa agccgcaatg cagcatcact ggaaaataca aaccaatggc 1500 

taaaagtaca taagttaatg cctaaagaag tcatatacca gcggctaata attgtacaat 1560 

caagtggcta aacgtaccgt aatttgccaa cggcttgtgg ggttgcagaa gcaacggcaa 162 0 
agccccactt ccccacgttt gtttcttcac tcagtccaat ctcagctggt gatcccccaa ' 1680 

ttgggtcgct tgtttgttcc ggtgaagtga aagaagacag aggtaagaat gtctgactcg 1740 

gagcgttttg catacaacca agggcagtga tggaagacag tgaaatgttg acattcaagg 1800 

agtatttagc cagggatgct tgagtgtatc gtgtaaggag gtttgtctgc cgatacgacg 1860 

aatactgtat agtcacttct gatgaagtgg tccatattga aatgtaagtc ggcactgaac 1920 

aggcaaaaga ttgagttgaa actgcctaag atctcgggcc ctcgggcctt cggcctttgg 1980 

gtgtacatgt ttgtgctccg ggcaaatgca aagtgtggta ggatcgaaca cactgctgcc 2040 

tttaccaagc agctgagggt atgtgatagg caaatgttca ggggccactg catggtttcg 2100 

aatagaaaga gaagcttagc caagaacaat agccgataaa gatagcctca ttaaacggaa 2160 

tgagctagta ggcaaagtca gcgaatgtgt atatataaag gttcgaggtc cgtgcctccc 2220 

tcatgctctc cccatctact catcaactca gatcctccag gagacttgta caccatcttt 2280 

tgaggcacag aaacccaata gtcaaccgcg gactgcgcat catgtatcgg aagttggccg 2340 

tcatctcggc cttcttggcc acagctcgtg ctcagtcggc ctgcactctc caatcggaga 2400 

ctcacccgcc tctgacatgg cagaaatgct cgtctggtgg cacttgcact caacagacag 2460 

gctccgtggt catcgacgcc aactggcgct ggactcacgc tacgaacagc agcacgaact 2520 

gctacgatgg caacacttgg agctcgaccc tatgtcctga caacgagacc tgcgcgaaga 2580 

actgctgtct ggacggtgcc gcctacgcgt ccacgtacgg agttaccacg agcggtaaca 2640 

gcctctccat tggctttgtc acccagtctg cgcagaagaa cgttggcgct cgcctttacc 2700 

ttatggcgag cgacacgacc taccaggaat tcaccctgct tggcaacgag ttctctttcg 2760 

atgttgatgt ttcgcagctg ccgtaagtga cttaccatca acccctgacg tatcttcttg 2820 

tgggctccca gctgactggc caatttaagg tgcggcttga acggagctct ctacttcgtg 2880 

tccatggacg cggatggtgg cgtgagcaag tatcccacca acaccgctgg cgccaagtac 2940 



ggcacggggt actgtgacag ccagtgtccc cgcgatctga agttcatcaa tggccaggcc 3 000 

aacgttgagg gctgggagcc gtcatccaac aacgcaaaca cgggcattgg aggacacgga 3060 

agctgctgct ctgagatgga tatctgggag gccaactcca tctccgaggc tcttaccccc 3120 

cacccttgca cgactgtcgg ccaggagatc tgcgagggtg atgggtgcgg cggaacttac 3180 

tccgataaca gatatggcgg cacttgcgat cccgatggct gcgactggaa cccataccgc 3240 

ctgggcaaca ccagcttcta cggccctggc tcaagcttta ccctcgatac caccaagaaa 3300 

ttgaccgttg tcacccagtt cgagacgtcg ggtgccatca accgatacta tgtccagaat 3360 

ggcgtcactt tccagcagcc caacgccgag cttggtagtt actctggcaa cgagctcaac 3420 

gatgattact gcacagctga ggaggcagaa ttcggcggat cctctttctc agacaagggc 3480 

ggcctgactc agttcaagaa ggctacctct ggcggcatgg ttctggtcat gagtctgtgg 3540 

gatgatgtga gtttgatgga caaacatgcg cgttgacaaa gagtcaagca gctgactgag 3600 
atgttacagt actacgccaa catgctgtgg ctggactcca cctacccgac aaacgagacc 1 3660 

tcctccacac ccggtgccgt gcgcggaagc tgctccacca gctccggtgt ccctgctcag 3720 

gtcgaatctc agtctcccaa cgccaaggtc accttctcca acatcaagtt cggacccatt 3780 

ggcagcaccg gcaaccctag cggcggcaac cctcccggcg gaaacccgcc tggcaccacc 3840 

accacccgcc gcccagccac taccactgga agctctcccg gacctactag taagcggata 3900 

aggcgcgccg cgcgccagct ccgtgcgaaa gcctgacgca ccggtagatt cttggtgagc 3960 

ccgtatcatg acggcggcgg gagctacatg gccccgggtg altttattttt tttgtatcta 4020 

cttctgaccc ttttcaaata .tacggtcaac tcatctttca ctggagatgc ggcctgcttg 4080 

gtattgcgat gttgtcagct tggcaaattg tggctttcga aaacacaaaa cgattcctta 4140 

gtagccatgc attttaagat aacggaatag aagaaagagg aaattaaaaa aaaaaaaaaa 4200 

acaaacatcc cgttcataac ccgtagaatc gccgctcttc gtg.tatccca gtaccagttt 4260 

attttgaata gctcgcccgc tggagagcat cctgaatgca agtaacaacc gtagaggctg 4320 

acacggcagg tgttgctagg gagcgtcgtg ttctacaagg ccagacgtct tcgcggttga 4380 

tatatatgta tgtttgactg caggctgctc agcgacgaca gtcaagttcg ccctcgctgc 4440 

ttgtgcaata atcgcagtgg ggaagccaca ccgtgactcc catctttcag taaagctctg 4500 

ttggtgttta tcagcaatac acgtaattta aactcgttag catggggctg atagcttaat 4560 

taccgtttac cagtgccgcg gttctgcagc tttccttggc ccgtaaaatt cggcgaagcc 4620 

agccaatcac cagctaggca ccagctaaac cctataatta gtctcttatc aacaccatcc 4680 

gctcccccgg gatcaatgag gagaatgagg gggatgcggg gctaaacaag cctacataac 4740 

cctcatgcca actcccagtt tacactcgtc gagccaacat cctgactata agctaacaca 4800 

gaatgcctca atcctgggaa gaactggccg ctgataagcg cgcccgcctc gcaaaaacca 4860 

tccctgatga atggaaagtc cagacgctgc ctgcggaaga cagcgttatt gatttcccaa 4920 

agaaatcggg gatcctttca gaggccgaac tgaagatcac agaggcctcc gctgcagatc 4980 

ttgtgtccaa gctggcggcc ggagagttga cctcggtgga agttacgcta gcattctgta 5040 

aacgggcagc aatcgcccag cagttagtag ggtcccctct acctctcagg gagatgtaac 5100 

aacgccacct tatgggacta tcaagctgac gctggcttct gtgcagacaa actgcgccca 5160 

cgagttcttc cctgacgccg ctctcgcgca ggcaagggaa ctcgatgaat actacgcaaa 522 0 

gcacaagaga cccgttggtc cactccatgg cctccccatc tctctcaaag accagcttcg 5280 

agtcaaggta caccgttgcc cctaagtcgt tagatgtccc tttttgtcag ctaacatatg 5340 

ccaccagggc tacgaaacat caatgggcta catctcatgg ctaaacaagt acgacgaagg 5400 

ggactcggtt ctgacaacca tgctccgcaa agccggtgcc gtcttctacg tcaagacctc 5460 

tgtcccgcag accctgatgg tctgcgagac agtcaacaac atcatcgggc gcaccgtcaa 5520 

cccacgcaac aagaactggt cgtgcggcgg cagttctggt ggtgagggtg cgatcgttgg 5580 

gattcgtggt ggcgtcatcg gtgtaggaac ggatatcggt ggctcgattc gagtgccggc 5640 

cgcgttcaac ttcctgtacg gtctaaggcc gagtcatggg cggctgccgt atgcaaagat 5700 

ggcgaacagc atggagggtc aggagacggt gcacagcgtt gtcgggccga ttacgcactc 5760 

tgttgagggt gagtccttcg cctcttcctt cttttcctgc tctataccag gcctccactg 5820 

tcctcctttc ttgcttttta tactatatac gagaccggca gtcactgatg aagtatgtta 5880 

gacctccgcc tcttcaccaa atccgtcctc ggtcaggagc catggaaata cgactccaag 5940 

gtcatcccca tgccctggcg ccagtccgag tcggacatta ttgcctccaa gatcaagaac 6000 

ggcgggctca atatcggcta ctacaacttc gacggcaatg tccttccaca ccctcctatc 6060 

ctgcgcggcg tggaaaccac cgtcgccgca ctcgccaaag ccggtcacac cgtgaccccg 6120 

tggacgccat acaagcacga tttcggccac gatctcatct cccatatcta cgcggctgac 6180 

ggcagcgccg acgtaatgcg cgatatcagt gcatccggcg agccggcgat tccaaatatc 6240 

aaagacctac tgaacccgaa catcaaagct gttaacatga acgagctctg ggacacgcat 6300 

ctccagaagt ggaattacca gatggagtac cttgagaaat ggcgggaggc tgaagaaaag 63 60 



I 



gccgggaagg aactggacgc catcatcgcg ccgattacgc ctaccgctgc ggtacggcat 6420 

gaccagttcc ggtactatgg gtatgcctct gtgatcaacc tgctggattt cacgagcgtg 6480 

gttgttccgg ttacctttgc ggataagaac atcgataaga agaatgagag tttcaaggcg 6540 

gttagtgagc ttgatgccct cgtgcaggaa gagtatgatc cggaggcgta ccatggggca 6600 

ccggttgcag tgcaggttat cggacggaga ctcagtgaag agaggacgtt ggcgattgca 6660 

gaggaagtgg ggaagttgct gggaaatgtg gtgactccat agctaataag tgtcagatag 6720 

caatttgcac aagaaatcaa taccagcaac tgtaaataag cgctgaagtg accatgccat 6780 

gctacgaaag agcagaaaaa aacctgccgt agaaccgaag agatatgaca cgcttccatc 6840 

tctcaaagga agaatccctt cagggttgcg tttccagtct agacacgtat aacggcacaa 6900 

gtgtctctca ccaaatgggt tatatctcaa atgtgatcta aggatggaaa gcccagaatc 6960 

taggcctatt aatattccgg agtatacgta gccggctaac gttaacaacc ggtacctcta 7020 

gaactatagc tagcatgcgc aaatttaaag cgctgatatc gatcgcgcgc agatccatat 7 080 

atagggcccg ggttataatt acctcaggtc gacgtcccat ggccattcga attcgtaatc 7140 

atggtcatag ctgtttcctg tgtgaaattg ttatccgctc acaattccac acaacatacg 7200 

agccggaagc ataaagtgta aagcctgggg tgcctaatga gtgagctaac tcacattaat 7260 

tgcgttgcgc tcactgcccg ctttccagtc gggaaacctg tcgtgccagc tgcattaatg 7320 

aatcggccaa cgcgcgggga gaggcggttt gcgtattggg cgctcttccg cttcctcgct 7380 
cactgactcg ctgcgctcgg tcgttcggct gcggcgagcg gtatcagctc actcaaaggc ' 7440 

ggtaatacgg ttatccacag aatcagggga taacgcagga aagaacatgt gagcaaaagg 7 500 

ccagcaaaag gccaggaacc gtaaaaaggc cgcgttgctg gcgtttttcc ataggctccg 7560 

•cccccctgac gagcatcaca aaaatcgacg ctcaagtcag aggtggcgaa acccgacagg 7 62 0 

actataaaga taccaggcgt ttccccctgg aagctccctc gtgcgctctc ctgttccgac 7680 

cctgccgctt accggatacc tgtccgcctt tctcccttcg ggaagcgtgg cgctttctca 7740 

tagctcacgc tgtaggtatc tcagttcggt gtaggtcgtt cgctccaagc tgggctgtgt 7800 

.gcacgaaccc cccgttcagc ccgaccgctg cgccttatcc ggtaaotatc gtcttgagtc 7860 

caacccggta agacacgact tatcgccact ggcagcagcc actggtaaca ggattagcag 7920 

agcgaggtat gtaggcggtg ctacagagtt cttgaagtgg tggcctaact acggctacac 7 980 

tagaagaaca gtatttggta tctgcgctct gctgaagcca gttaccttcg gaaaaagagt 8040 

tggtagctct tgatccggca aacaaaccac cgctggtagc ggtggttttt ttgtttgcaa 8100 

gcagcagatt acgcgcagaa aaaaaggatc tcaagaagat cctttgatct tttctacggg 8160 

gtctgacgct cagtggaacg aaaactcacg ttaagggatt ttggtcatga gattatcaaa 8220 

aaggatcttc acctagatcc ttttaaatta aaaatgaagt tttaaatcaa tctaaagtat 8280 

atatgagtaa acttggtctg acagttacca atgcttaatc agtgaggcac ctatctcagc 8340 

gatctgtcta tttcgttcat ccatagttgc ctgactcccc gtcgtgtaga taactacgat 8400 

acgggagggc ttaccatctg gccccagtgc tgcaatgata ccgcgagacc cacgctcacc 84 60 
ggctccagat ttatcagcaa taaaccagcc agccggaagg gccgagcgca gaagtggtcc ' 8520 

tgcaacttta tccgcctcca tccagtctat taattgttgc cgggaagcta gagtaagtag 8580 

ttcgccagtt aatagtttgc gcaacgttgt tgccattgct acaggcatcg tggtgtcacg 8640 

ctcgtcgttt ggtatggctt cattcagctc cggttcccaa cgatcaaggc gagttacatg 8700 

atcccccatg ttgtgcaaaa aagcggttag ctccttcggt cctccgatcg ttgtcagaag 8760 

taagttggcc gcagtgttat cactcatggt tatggcagca ctgcataatt ctcttactgt 8820 

catgccatcc gtaagatgct tttctgtgac tggtgagtac tcaaccaagt cattctgaga 8880 

atagtgtatg cggcgaccga gttgctcttg cccggcgtca atacgggata ataccgcgcc 8940 

acatagcaga actttaaaag tgctcatcat tggaaaacgt tcttcggggc gaaaactctc 9000 

aaggatctta ccgctgttga gatccagttc gatgtaaccc actcgtgcac ccaactgatc 9060 

ttcagcatct tttactttca ccagcgtttc tgggtgagca aaaacaggaa ggcaaaatgc 9120 

cgcaaaaaag ggaataaggg cgacacggaa atgttgaata ctcatactct tcctttttca 9180 

atattattga agcatttatc agggttattg tctcatgagc ggatacatat ttgaatgtat 9240 

ttagaaaaat aaacaaatag gggttccgcg cacatttccc cgaaaagtgc cacctgacgt 9300 

ctaagaaacc attattatca tgacattaac ctataaaaat aggcgtatca cgaggccctt 9360 

tcgtctcgcg cgtttcggtg atgacggtga aaacctctga cacatgcagc tcccggagac 9420 

ggtcacagct tgtctgtaag cggatgccgg gagcagacaa gcccgtcagg gcgcgtcagc 9480 

gggtgttggc gggtgtcggg gctggcttaa ctatgcggca tcagagcaga ttgtactgag 9540 

agtgcaccat aaaattgtaa acgttaatat tttgttaaaa ttcgcgttaa atttttgtta 9600 

aatcagctca ttttttaacc aataggccga aatcggcaaa atcccttata aatcaaaaga 9660 

atagcccgag atagggttga gtgttgttcc agtttggaac aagagtccac tattaaagaa 9720 

cgtggactcc aacgtcaaag ggcgaaaaac cgtctatcag ggcgatggcc cactacgtga 9780 



accatcaccc aaatcaagtt ttttggggtc gaggtgccgt aaagcactaa atcggaaccc 9840 

taaagggagc ccccgattta gagcttgacg gggaaagccg gcgaacgtgg cgagaaagga 9900 

agggaagaaa gcgaaaggag cgggcgctag ggcgctggca agtgtagcgg tcacgctgcg 9960 

cgtaaccacc acacccgccg cgcttaatgc gccgctacag ggcgcgtact atggttgctt 10020 

tgacgtatgc ggtgtgaaat accgcacaga tgcgtaagga gaaaataccg catcaggcgc 10080 

cattcgccat tcaggctgcg caactgttgg gaagggcgat cggtgcgggc ctcttcgcta 10140 

ttacgccagc tggcgaaagg gggatgtgct gcaaggcgat taagttgggt aacgccaggg 10200 

ttttcccagt cacgacgttg taaaacgacg gccagtgcc 10239 

<210> 20 

<211> 12 

<212> DNA 1 

<213> Trichoderma reesei 

<400> 20 

actagtaagc gg 12 



<210> 21 

<211> 41 

<212> DNA 

<213> Artificial Sequence 

<220> , ' 

<223> primer 

<400> 21 

gcttatacta gtaagcgcgc gggcggcggc tattggcaca c 41 



<210> 22 

<211> 39 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> primer 



<400> 22 

gcttatggcg .cgccttagac aggatcgaaa atcgacgac 39 

<210> 23 

<211> 43 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 



<400> 23 

ctaagagaaa cgacccgtac atccagcggt tcctcacgat gta 43 

<210> 24 

<211> 38 

<212> DNA 

<213> Artificial Sequence 



<220> 
<223> 



primer 



t 



<400> 24 

ttacccggat gggaagagca tgccaaaatc ggcgttcg 

<210> 25 

<211> 43 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 25 

ctaagagagc gacgactcag ccgtacacct ggagcaacgt ggc 

<210> 26 

<211> 38 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 26 

ttacgatccg gacggcgcac caccaatgtc cccgtata 

<210> 27 

<211> 42 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 27 

gcttatacta gtaagcgcgc cggctgctcg gtggactaca eg 

<210> 28 

<211> 37 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 28 

gcttatggcg cgccttacag aggegggtag gcgttgg 

<210> 29 

<211> 42 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 29 



gcttatacta gtaagcgcgc cggtctcacc gccacagtca 



<210> 30 

<211> 36 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 30 

gcttatggcg cgcctcagga ctggagcttg ctccgc 
21 



